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On Communication of Analog Data from 
a Bounded Source Space 


By AARON D. WYNER and JACOB ZIV 
(Manuscript received June 5, 1969) 


We consider the problem of the transmission of discrete-time analog data 
with a variety of fidelity criteria. The outputs of the analog source are as- 
sumed to belong to a bounded set. Bounds on the minimum achievable average 
distortion for memoryless sources are derived both for the case where the 
coding delay ts infinite (an extension of the Shannon Theory) and also for 
some cases where the coding delay is finite. Several examples are given, for 
which the upper and lower bounds coincide. 

Further, we discuss the case where the assumption of the existence of a 
probabilistic model for the source is dropped. We adopt as our fidelity 
criterion the supremum over all possible source-output n-sequences x, of 
the conditional expectation of the distortion given x (‘guaranteed distor- 
tion”). The Shannon Theory is not directly applicable in determining the 
minimum guaranteed distortion. We do obtain results for two important 
cases. Some generalizations and applications are also discussed. 


I. INTRODUCTION 


In this paper we are concerned with communication of discrete-time 
analog data over a communication channel with a variety of fidelity 
criteria. The central assumption about the analog source is that its 
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outputs belong to a bounded set, typically the interval [—A/2, A/2]. 
We begin with a rough outline of our results, leaving the precise formula- 
tion and statement to Section II. Proofs are found in Section III. 

Suppose that we have a data source which emits a sequence of sym- 
bols x, , 2%, °:: © X (an arbitrary set) at a rate of ps per second. This 
sequence is fed into an “‘encoder”’ which assigns to each successive block 
of n source symbols, say x = (4, 2, °-* , X,), a channel input of dura- 
tion n/ps = T seconds. At the receiving end of the channel, the T- 
second output is transformed by a ‘decoder’ into an n-sequence, say 
xX = (4,, £2, --- , €,), which is delivered to the destination. The “‘dis- 
tortion” between the source output sequence x and the received sequence 
% is defined as d(x, %) = n7™* >Ot_, d(x, , £,), where d(a, 4) = O is 
an arbitrary function. 

The classical problem is that of a ‘‘memoryless” source, where suc- 
cessive source outputs are statistically independent with identical 
probability distribution. In this case it is meaningful to let the system 
performance criterion (fidelity criterion) be the statistical expectation 
of the distortion d (x, %). A quantity of interest is d*(T), the smallest 
attainable value of the fidelity criterion when the coding delay is T 
seconds. The Shannon Theory gives the asymptotic behavior of d*(T) 
as T — o. In many cases this limit is difficult to evaluate analytically. 
Theorem 1 (in Section 2.2) considers the case where the source output set 
x” = [—A/2, A/2], and the function d(«, £) depends only on the dif- 
ference £ — x. This theorem gives a lower bound on limit... d*(T). 
The examples which follow this theorem illustrate the applicability 
and utility of the bound. 

There are two cases in which we are particularly interested. In the 
first, the source set X = {0,1,--- , K — 1} with a uniform distribution, 
and d(z, £) = 0 or 1 according as x = ¢ or x # #. Thus the fidelity 
criterion is the error-rate. For this case let d*(T) = P.(K, T). In the 
second case, % = [—A/2, A/2] with a uniform distribution, and d(a, £) = 
0 or 1 according as | « — €| < dor|x — £| = 6 (where 6 > 0). In 
this case let d*(T) = Q(T, A, 6). It turns out that P, and Q are inti- — 
mately related. In fact it is a consequence of Theorem 2 (Section 2.2) 
that if A/(26) = K,, an integer, then Q(7, A, 6) = P,(T, Ky). This 
result is valid for all values of the delay parameter 7. From this result 
it can be deduced that the optimal encoder for the analog source X = 
[—A/2, A/2] is a “uniform” quantizer followed by an optimal ‘‘digital” 
encoder. This is the only known case for which analog-to-digital con- 
version is known to be optimal for finite T for the transmission of analog 
data from a memoryless source. 

We now drop the assumption of a memoryless source. In fact we do 
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not even assume that there is a probabilistic model for the source. 
Instead of the expectation of the distortion, we adopt as our fidelity 
criterion, the supremum, over all possible source output n-sequences x, 
of the conditional expectation of the distortion given x. We call this 
criterion the “guaranteed distortion’. Let d*(T) be the minimum 
attainable guaranteed distortion for a system with delay parameter 7’. 
The Shannon Theory is not directly applicable in determining d*(T). 
We do obtain results for the two interesting cases discussed below. 

In the first, 9¢ = {0,1,---,& — 1} and d(z, 4) = Oor 1, respectively, 
when « = éorz ¥ 4. For this case let d*(T) = P.(T, K). It is a conse- 
quence of Theorem 3 (Section 2.3) that limit... P,(T, K) = limitr.. 
P,(T, K), which is known from the Shannon Theory. 

In the second case, X = [—-A/2, A/2] and d(x, #) = 0 or 1, respect- 
ively, when |x — £| < dor|2 — 4| = 4. For this case, let d*(T) = 
Q(T, A, 5). Theorem 4 (Section 2.3) relates P, and Q by 


Q(T, A, 6) = Pr; M), 


where JM is the unique integer satisfying (M — 1) S$ A/(26) < M. 
Here too, we can deduce the optimality of analog-to-digital conversion. 
Theorem 4 is generalized by Theorem 5 (Section 2.4) to apply to an 
arbitrary set 9 with a distance-like measure defined on it (replacing 
|e —)). 

In Section 2.5, we give some applications of the above results. In 
particular we obtain some results for the distortion d(z, 4) = |x — #|’. 

In order to state our results completely and precisely, it is unfor- 
tunately necessary to give a rather large collection of definitions and 
to introduce a large number of symbols. In order to ease the reader’s 
burden somewhat, we have included a glossary of symbols in the ap- 
pendix. 


II. STATEMENT OF THE PROBLEM AND PRINCIPAL RESULTS 


In Section 2.1 we define a ‘‘channel” (and its ‘‘capacity’’) in a very 
general and abstract way. We do this because the nature of the channel 
does not figure explicitly in our results (except for the channel capacity), 
and we want our results to apply as broadly as possible. In Section 2.2 
we describe the communication system which we shall consider, and 
state our results for the case of a “memoryless” information source. 
The remainder of the results follows in Sections 2.3~2.5. 


2.1 Channel and Channel Capacity 


A channel is defined as follows. For every T > 0 we have a set W, 
of “allowable” inputs and a set 37 of possible outputs. Every 7 
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seconds some we W- is transmitted through the channel, and the chan- 
nel output zis a member of 37 . The output is related to the input we W 
by a probability measure yu, on the set 37. Thus given that we W, is 
transmitted, the probability that 22 B [where B is a (measurable) subset 
of 37] is u,(B). For example Wz, and 37 may be the set of binary se- 
quences of length [T]"'. The measure yu, is then a discrete conditional 
probability distribution. Another example is the case where Wy, and 
3, are sets of real valued functions defined on the interval [0, 7], and 
the members of ‘Wy, have “energy” not exceeding PT. 

With T specified, a block code with parameter N is a set of N pairs 
{(w;, B;)}7_,, where w; © Wr are called code words and the 
collection of B; is a set of disjoint (measurable) subsets of 3, called 
decoding sets. If code word w;(1 S$ 7 S N) is transmitted, the resulting 
error probability is 


\; = Pr {z¢ B; | w; is transmitted} = 1 — u,,,(B;). (1) 
The word error probability for the code is 
\ = max 2; . (2) 
1sisN 


Let \*(T, N) be the smallest attainable word error probability for a 
code with parameters T and N. The channel capacity C is defined as 
the supremum of those numbers R 2 0, for which 


A*(T, [e*7).) — 0, as To. 


Let us define the average word error probability by 
oe a 
X= % a Ai. (3) 


Thus X is the resulting average error probability which results when 
each of the N code words are equally likely to be transmitted. Let us 
define \*(7', N) as the smallest attainable value of X for a code with 
parameters 7’ and N. Since X S X for any code, it follows from the 
above definition of channel capacity that for any R < C, 


X*(T, [e*"|)-) > 0, as To ~., 
Further it is known that for a large class of channels including the mem- 
oryless gaussian channel and discrete memoryless channels,” 
x*(T, [e°7]-) 4, as To, (4) 
[It is also true that for many of these same channels if R > C, 


+ Throughout this paper we denote by [x]~ and [z]* the largest integer < z 
and the smallest integer > «x respectively (0 < x < 0). 
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x*(T, [e*7]") tends to 1 as T — , but we do not need this fact here.] 

Let us remark here that for a large class of channels (including “‘mem- 
oryless’ channels and “finite state channels’’), the capacity C is known 
to be the supremum of a quantity called the “information”. In fact 
this equivalence is the essence of the Fundamental Theorem of Informa- 
tion Theory. It will not be necessary, however, to explore this equi- 
valence further. 


2.2 Memoryless Source and Communication With a Fidelity Criterion 


Consider the communication system of Figure 1. The output of the 
source is a sequence of random variables X, , X2, --- from an arbi- 
trary subset 9 of Euclidean p-space. Assume that these random vari- 
ables are statistically independent and identically distributed with 
probability density function Ps(x), x « X. If we allow impulses in the 
density function, then the X, can be discrete random variables. Say 
that the source outputs appear at a rate of pg per second. The encoder 
waits 7 seconds (called the ‘‘delay’”’) during which time n = ps7’ sym- 
bols, say X, , X2,--+ ,X,eX, have appeared at its input. (Assume that 
psi’ is an integer.) Denote the T-second output of the source by the 
random n-vector X = (X,, X2,°-: ,X,) eX”. 

The channel is defined as above (Section 2.1), so that during the 
T seconds which it takes for the n-vector X to appear, the channel can 
process an input belonging to the channel input set W,. It is the task 
of the encoder to assign to each possible source output n-vector X = x, 
a channel input fz(x) e W,. The channel output is a member Z of the 
channel output set 3, , and it is the task of the decoder to assign to 
each possible Z = z an n-vector X = fp(z) e X”. Note that the source 
and channel statistics define a joint probability density on the random 
n-vectors X and X. 

Now ideally we would like X = X. But this is most often not possible 
due to imperfections (for example, noise) in the channel. Thus we define 
a fidelity criterton which we use as a measure of the reliability of the 
system. Suppose we are given a non-negative distortion function 
d(x, £) defined on X X &. Typical choices of the distortion function are 
d(x, 4) = |x — #|*(s > 0) when & is a subset of the reals (that is, the 
dimensionality p = 1), or the “Hamming” distance 


SOURCE ENCODER CHANNEL DECODER 


Fig. 1— Communication system. 


f= (X) X=fp(z) 
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d(x, #) = dy(e, 4) = , aie (5) 
1, x &, 
where & is a discrete (that is, countable) set. 
The distortion between the n-vectors, x = (2, %2, °** , %,) and 
x = (41 , @e, ee 8 yx) 18 


d(x, x) = n » d(x, ; £,). 
1 


Our system performance (fidelity) criterion, which we seek to minimize, 
is 


d = Ea” (X, %), 


where £ denotes expectation (with respect to the joint probability dis- 
tribution of X and X). For a given delay 7, which corresponds to 
n = psT, let d*(T) denote the infimum (with respect to all encoder- 
decoder pairs) of the attainable values of d (for given ps and source- 
channel statistics). Although we usually do not know d*(T) exactly, we 
do know its asymptotic behavior as T — ©. We proceed as follows. 

For 0 S B S ~, define M%(8) as the set of probability density func- 
tions p(x, £) defined on X& X 9X which satisfy 


(t) fe p(x, ) dé = Ps(x), the source output probability density 
function, 
(27) Ju Sa d(x, £) p(x, £) dx dé S 8B. 


The znformation corresponding to the density p(x, £) ¢ NT(8) is defined as 


ayy 4 p(x, £) A 
Tipe, )) = | | pe, 4) log pea de at, 6) 
where p.(é) = Jx p(x, £) dx. It is easy to show that J = 0 with equality 
if and only if p(x, £4) = P,(x)p.(é). Finally define the equivalent rate 
of the source 
ReA(8) = inf I{p(, 2}. (7) 
p(z,£) eI (B) 
R..,(8) is usually called the ‘‘rate-distortion function”. Note that R.,(8) 
depends only on 8 and P,(#). 
Let us now return to the quantity d*(7’). Shannon’s well known 
theorems tell us the following.” For a given communication system 
(as in Fig. 1), 
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(@) d(T) =d, ~~ forall 7, 8) 
(2) d(T) ~>d,, as Too, 
where d, is the smallest solution of 
pReq(do) S C, 


and C is the capacity of the channel. 

Some intuitive insight into the meaning of Shannon’s theorem can be 
gained by thinking of p,R,,(@) as the equivalent rate in nats per second 
of the source (when reproduced with distortion 8). It is reasonable 
then to suppose that the minimum attainable distortion d, is that 
distortion for which the source rate is just equal to the channel capacity 
Ce 

There are two well-known cases for which R,,(8) is known explicitly. 
The first is the case where X = the reals, Ps(x) = (27)? exp (—2”/20’), 
and d(x, £) = (« — #)*. In this case, R.,(8) = 3 log «”/8’, so that dy = 
a” exp (—2C/ps). 

The second case (which is important in the sequel) is&X¢ = {0,1,2,---, 
K — 1} (K = 2,3, +++), Ps(v) = D 423 (1/K)6(x — k)[6(x) is the unit 
impulse], and d(z, £) is given by equation (5). In other words, the source 
output is a sequence of independent random variables, each equally 
distributed on the K-ary alphabet {0, 1, --- , K — 1}. The quantity 
d is the average fraction of symbols received in error, and is often called 
the “error-rate’”’. In this case, we write d*(7) = P,(T, K), where the 
dependence of P, on K as well as T7' is indicated explicitly. For this 
case it is known that” 





logK —h@)—Blog(K—1), BS 
R.q(8) = (9a) 


0, B2 





where 
h(6) = —Blog8— (1 — 8) log (1— 8), OS6B 1). (9b) 


Shannon’s theorem, equation (8), tells us that 


eal; KK) V(K, ps , C), T- o, (10a) 
where 7(K, ps , C) is the smallest solution of 
pR.q(7) =, (10b) 


and C is the channel capacity. A graph of y(K, ps , C) versus C'/Szg for 
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1.0 


0.75 f 


— 


0.5 


¥(K,ps,C) 


0.25 


C/p, 


Fig. 2—(K, ps, C) versus C/ps (K-a parameter). 


various values of K is given in Fig. 2. Notice that y(K, ps , C) decreases 
from ( — 1)/K to zero as C/ps increases from zero to log K. 

Let us also remark that the quantity P,(T, K) is related to \*(T, MM) 
(the smallest attainable average word error probability). In fact it is 
easy to show that 


> ae, K") < PT, K) < ¥*(T, K’) (11) 


where n = psT (assumed to be an integer). 

Now, in the general case [arbitrary Ps(x) and d(z, #)], it is usually 
not possible to obtain a closed form expression for R,,(8). Theorem 1, 
which is stated below, gives a useful bound on R,,(8) for the case where 
Ps(a) is a density and X is a bounded set. This theorem is an extension 
of a result of Shannon.” The proof is given in Section 3.1. 

Let X be the interval [—A/2, A/2], where A(O < A < o&) is arbi- 
trary. Let the source outputs X have density Ps(x), and let d(z, ¢) = 
r(x — £), where r(u) satisfies 


(2) ru) = r(—u), 
(it) r(u) = 0, with equality at u = 0, (12) 
(iit) r(u) is continuous at u = 0. 
Then it can be shown (see Ref. 3, Appendix A) that forO < 6 
1/A f47,. r(u) du, there exists a unique (8) which satisfies 


A/2 A/2 
i r(uje Pr du fos a | ghee du. (13) 


—A/2 A/2 
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Define the probability density gg(x) on X by 


A/2 -1 
ga(2) — If - ee (A) (a) ae | ge. (14a) 


[note that f r(x)gg(x) dx = 6], and let 


H,(6) = — J an(0) log gol) de (14b) 


be the corresponding ‘‘entropy’’. 

For A = ©, equation (13) has a solution in many cases. In particular, 
when 7(u) = | u |* (s > 0), equation (10) has a solution for0 < B < ~. 
Thus g,(x) and H,(@) are meaningful for A = © also. 

We now state the lower bound on R,,(8) as Theorem 1. 


Theorem 1: For the source defined above, for0 < B S A f42,. r(u) du, 
R.(8) 2 Hs — H,(8), (15a) 


where 
A/2 
His | P(x) log Pg(x) dx (15b) 
~-A/2 


is the entropy of the source density P s(x), and H,(8) ts defined in equations 
(13) and (14). Inequality (15a) also holds for A = ©, when r(u) = |u|’ 
(s > 0). 


Examples: 


(<) Say %& = the reals, and d(z, 4) = r(x — #) = |x — #|’, where 
s > Ois arbitrary. Theorem | is applicable with A = «. Solving equa- 
tion (13), yields \,(8) = (s@)~* and 


(s~1)/s 


gs(0) = ——~ exp [=| 2 |'/(68)], 
2ar(’) 
so that 
R.g(8) 2 Hs — H,(8), (16a) 
where 
1 aer(*)s 
H,(8) = = log | —j=1— |, (16b) 


and Hs is given by equation (15b). 
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(ii) Quadratic Distortion: Let 9 = the reals, and d(z, #) = (x — 4)’. 
Then from example (2), with s = 2, 


R.q(8) 2 Hs — 3 log 2mep. (17a) 


Further Shannon* has given the following upper bound to R.,(8): 


R.,(8) S 4 log 5 , BSo’, (17b) 


where o” = f 2’Ps(x) dx. Note that when P(x) = (240*)7? exp (—2’/2), 
the upper and lower bounds of inequalities (17) coincide for B S o’. 
[Since R,,(8) is non-increasing, R.,(8) = 0 for B > o*.] 

Another case of interest is % = [—A/2, A/2](A < @), Ps(x) = A’, 
and d(x, 4) = (« — #)”. In this case Theorem 1 (applied for finite A) 
provides a lower bound on R,,(8) which is tighter than that of inequality 
(17a) and can be evaluated numerically. An upper bound can be found 
by computing I[p.(x, €)], where p.(#, £), a joint probability density 
for X and X, is defined by the following: The variate X has density 
P s(x). The variate X = a(X + Y), where the Y is a Gaussian variate, 
independent of X, with 


EY =0 and EY’ = BA’/(A? — 128), 
and 
a = (A? — 128)/A’. 


Note that E(X — X)? = B. The information I[po(x, £)] corresponding 
to p(x, ) can also be evaluated numerically and is an upper bound to 
R., (6). Figure 3 is a graph of these bounds on R,,(8), and also of dy , 
the solution of psR.,(do) = C. 


(wii) Say X% = [—A/2, A/2]. Let Ps(x) = A~* and d(a, #) = r(x — 4) 
where, in addition to satisfying conditions (12), r(z) satisfies 


ru) = rv) if w=v (mod A). (18) 


[If, for example, A = 27 and & represents an angle, then equation (18) 
must hold.] For r(u) satisfying condition (18), the bound (15a) on 
R.4(8) of Theorem 1 holds with equality, namely, R., = Hs — H, (8). 
(Section 3.1) 

(iv) Threshold Distortion: Let X© = [—A/2, A/2] and let d(a, #) 
be the “threshold” distortion defined by 


d(x, #) = diz, €) = riz — 4), (19a) 
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ie) 0.4 0.8 12 1.6 2.0 24 2.8 





C/p,, R(B) 
Fig. 3— Bounds on 8/A? versus Req(8) or do/A? versus C'/ps. (7) upper 
aoa it) — — — lower bound (Theorem 1); (wi) — - — - — lower bound 
17a 
where 
= 
r;(u) = 4 | u | = 4, (19b) 
0, jul < 6. 


In this case, the bound (15) of Theorem 1 is 
Rea(8) 2 Hs — h(8) — log 26 — B log (A/26 — 1), (20) 


where h(@) is defined in equation (9b). There is a case where inequality 
(20) is satisfied with equality, namely Ps(x) = A™’* and A/(26) = 
K, = 1, 2, --- . For this case, we show in Section 3.1 that 


nae 





foe K, — h(@) — B log (K, — 1), 0 =6:= a 
R.,(8) = | 


0, pe 
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Notice the striking similarity of equations (21) and (9) for the discrete 
K-ary source. We will have more to say about this later. 

When A/(26) is not an integer, we show (in Section 3.1) the right 
member of (21) is an upper bound to R,,(8) with Ky replaced by 
[A/26]* = K, . Thus with inequality (20), 


log |4| — h(8) — B log |4 — | < R.,(8) 


< log K, — h(8) — B log (K, — 1). (22) 


A Result for Finite 7’ for the Threshold Distortion is as follows. 
Let Xx = [—A/2, A/2], Ps(x) = A’, and d(z, 4) = d;(z, £), the thresh- 
old distortion given in equations (19), as in example (2v) above. In the 
system of Fig. 1, let d*(T) = Q(T, A, 6), where the dependence on A 
and 6 is indicated explicitly. The results in example (7v) [equation (21)] 
and equation (10) imply that for 4/25 = K,, lim... Q(T, A, 6) = 
limri.o P.(T, Ko) = y(Ko, ps , C). This correspondence between Q and 
P, is extended to finite 7 in the Theorem 2 (proved in Section 3.2.). 


Theorem 2: Let K, = [A/26]*, K = [A/26]-. For all T, 
Pt, Key SOT, 4,9) 2 Fo K,): (23) 


The quantities P, and Q are defined, of course, for the same channel and 
source output rate ps. 

A case of particular interest is A/26 = K,, an integer, so that K, = 
K_ = K, and Theorem 2 yields 


PAT, Ko) = Q(T, A, 6), all T. (24) 


For this case we deduce from equation (24) that (for all 7’) the optimal 
encoder for the analog source is a K,-level “uniform” quantizer with 
quantization levels [(2i — Ky — 1)6]*2, followed by an optimal “digital” 
encoder. This is the only known case for which analog-to-digital con- 
version is known to be optimal for JT’ < o for the transmission of analog 
data. 


2.3 Case Where The Source Has No Statistics 


Suppose that the source output is, as in Section 2.2, a sequence of 
symbols from the source alphabet X, which appear at a rate of ps per 
second. However, in this case, as distinct from above, we assume that 
there is no known statistical model for the source. Say that, as in Sec- 
tion 2.2, the encoder waits 7’ seconds during which time n = ps7 source 
symbols x = (2, %2, --- , 2p) © X” have appeared. Again, as above, 
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the encoder output is f(x) e W,, the channel output is Z « 37 and the 
decoder output is X = fp(Z) 2 X”. The encoder-decoder pair and the 
channel statistics induce a probability density for X on X” which de- 
pends on x (the source output). Denote this density by f(% | x). As- 
suming, as in Section 2.3, that a non-negative distortion function 
d(x, €) on X X & is given, then the average distortion when the source 


output is x is 
a = [ [2 Sam, 49 |e 


where x = (2, ,%,°':,%,) and & = (4,,4.,--- , ,). Since we cannot 
take a meaningful statistical average over x, we adopt as our fidelity 
criterion, the ‘‘guaranteed’”’ distortion 
d = sup d(x). (26) 
xeX” 
Let d*(T) be the smallest attainable value of d for a given delay T 
(which corresponds to n = ps’). 

For the special case where X = {0, 1, --- , K — 1} and d(a, £) = 
di(z, £) [given by equations (5)] let d*(T) = P.(T, K) where the de- 
pendence on K is made explicit. Consider P,(T, K) (the average error- 
rate in Section 2.2). Clearly, 


PAT, K) = P.(T, K). 
The following theorem [taken together with equations (10)] shows that 


as T — o, P, and P, are asymptotically equal. The proof is in Section 
3.4. 


x) dx, (25) 





Theorem 8: For the communication system described above with K-ary 
source alphabet, source output rate ps, and channel capacity C, 
limit Po, K) ae yV(K, Ps ; C), (27) 
T3% 
where y(K, ps, C) is given by inequality (10b). 

A second important special case is &% = [—A/2, A/2], and d(x, £) = 
d;(«, £), the threshold distortion given by equations (19). In this case 
let d*(T) = Q(T, 6, A). The quantity Q can be related to P, , and 
Theorem 4 (proved in Section 3.3) is analogous to Theorem 2, though 
somewhat sharper. 


Theorem 4: For 0 < 6 S A, let M(6) be the integer satisfying 


M-1s a <M. (28a) 
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Then for all T, 
Q(T, A, 6) = P[T, M (8)]. (28b) 


The quantities P, and Q are defined, of course, for the same channel 
and source output rate pg . 

In constrast to Theorem 2, this theorem asserts the equality of cor- 
responding values of Q and P, for all values of A/(26). Also as in Theo- 
rem 2, this theorem implies that the optimal encoder for the source 
x = [—A/2, A/2], with d = d; [with a fidelity criterion as in equation 
(26)] is a uniform quantizer [with M (6) levels] followed by an optimal 
digital encoder (see part (2) of the proof of Theorem 4). 

Theorems 3 and 4 can be combined to obtain the following. 


Corollary: For0 < 6 S A, let M(6) be as in Theorem 4. Then 


lim QT, A, ) = y[M(65), Ps; C, (29) 


T—00 
where y is given by inequality (10b). 
2.4 Generalization to Arbitrary Source Alphabets 


In this section we consider the case where the source alphabet x 
is an arbitrary space with an arbitrary metric or metric-like function 
defined on it. We then give a generalization of Theorem 4. First we give 
some preliminary definitions. 

Let X& be a set and let p,(z, #) be real-valued function defined on 
x X X with the properties 


(t) po(t, £) = po(#, 2) (30a) 
(t7) po(x, £) = O with equality when x = ¢@. (30b) 

If in addition po(z, #) satisfies 
(it) pole, €) S pol, y) + poly, 4), (30c) 


then p)(z, #) is a metric; but we will not require inequality (30c) to 
hold. For ze X& and A > 0, let S,(A) = {€2%X : p(w, 4) < A} be the 
(open) sphere of radius A about x. 

Aset A C is called a “A-covering” (of ) if \U,.4 S,(A) contains x, 
and A is called a “‘A-packing” (of 9) if S,(A) (\ S;(A) is empty for all 
x, 2A, ~ £. Let M-(A) be the minimum number of points which can 
constitute a A-covering of X, and let M/p(A) be the maximum number 
of points which can constitute a A-packing. These quantities are related 
by the following lemma (proved in Section 3.4). 
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Lemma 1: Let = sups,y.20x po(X, Y)/[po(X, 2) + po(Z, y)]. Then for A>0, 
M ¢(2mA) Ss M>(A). (31) 


In particular, if p) is a metric, 7 S 1. Inequality (31) is of course 
meaningful only if 7 < ©. 

Now consider the communication system discussed in Section 2.3 
with an arbitrary source space Xt. Let po satisfy expressions (30a) and 
(80b), and define the ‘“‘threshold”’ distortion d;(x, #) by 


Ly polx, £) = 6, 


0, polk, #) < 6. 
Let d be the guaranteed distortion defined by equation (26) with the 
distortion d(x, 4) = d;(x, #) [given by equation (82)]. Finally, let 
G (T, 5) be the smallest attainable value of d for a system with delay 7. 
(The dependence of G on 4 is made explicit.) Of course G(T, 6) also de- 
pends on pg as well as the channel characteristics. The special case 
treated in Section 2.3 is X = [—A/2, A/2], p(x, #) = |x — #|. In 
this case G(T, 6) = Q(T, A, 8). 

The following is a generalization of Theorem 4 and is proved in Sec- 
tion 3.3. 


Theorem 5: Let M,;(A) and M>p(A) be as defined above for the source 
alphabet % [with a po(x, X)]. Then G(T, 6) satisfies 

PIT, M>(8)] S$ G(T, 8) S PIT, Me(6)], (33) 
where P, is defined in Section 2.3. Note that P, and G are defined for the 
same channel and source outpul rate ps. 


Theorem 5 reduces to Theorem 4 on noting that for & = [—A/2, A/2] 
and p)(v, £) = | w— ¢|, 


M>(6) = M¢(8) = M(), (34) 


where J (6) is defined by inequality (28a). Let us remark that although 
Mp = Mz, the maximum 56-packing is not in general identical to their 
minimum 6-covering. For example, when 6 = A/4, M(6) = 3, and the 
maximum 6-packing is unique, namely 


A A 
\-4 , 0, Al ’ 


* To be precise, we must assume that the space X and the encoder and decoder 
functions are measurable. 
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which is not a 6-covering. There are many 6-coverings, for example 


A A 
{- 2° 0, 3 \ 
2.5 Some Applications 


2.5.1 Rate at Which Q(T, A, 6) Approaches its Limit 


Consider again the source with © = [—A/2, A/2], Ps(x) = A’, 
and distortion d(x, £4) = d;(x, #) [defined by expressions (19)]. Suppose 
further that A/(26) = K,, an integer and that the channel capacity 
C = ps log K,. In this case y(Ky, ps , C) = 0 [see expressions (9) and 
(10)], so that from expressions (24) and (10a) 


lim Q(T, A, 6) = 0. 
T0 
We will now obtain a lower bound on the rate at which this limit is 
approached. From the first inequality in inequality (11), using n = ps7’, 


PT, K) = a HHT, 7). (35) 
psi’ 


Yor those channels for which expression (4) holds, the right member of 
inequality of (85) ~ (2ps7)~*. Combining expressions (24) and (85) 
we have that 


QP, A, 8) = s~zO +e), (36) 

Ps 
where £(7’) > 0 as T — o. Thus for the class of channels for which 
expression (4) holds and these parameter values, Q(7', A, 6), approaches 
its limit no faster than 7~*. Determination of the similar bounds on the 
rate of approach of Q to its limit for other parameters is an open question. 


2.5.2 The sth-Mean Distortion 


Consider the case where © = [—A/2, A/2], and the distor- 
tion d(x, xz) = |x — #|* (s > 0). When Ps(z) = A™, let the smallest 
attainable average distortion d*(T) * #(T). For the case of no source 
statistics (as in Section 2.3), let the smallest attainable guaranteed 
distortion d*(T) & #(T). We establish some properties of @ and é 
below. 

For any random variable Y (such that | Y | S$ A), and any 6, , 
5.10 S 6,, & S A), 
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i Pri] yY |S 6} sk|Y |= & Peril) yy] <3} 
+ A*Pr {| Y | = 6}. (37) 
It follows from inequality (37) that for arbitrary 6, , 6. (0 S 6,, 6. S A), 
Q(T, A, 6) S&) S &ll — QV, A, &)) + AQT, A, 5), 
(38a) 
and 
QM, A, i) S$ e(P) S ofl — QM, A, &)] + A°Q(Z, A, 62), (88b) 
where Q and @ are defined in Sections 2.2 and 2.3 respectively. Applica- 
tions of Theorems 2 and 4 (and Q, Q = 0) yields 
6P(T, K_-) S$ #7) Ss 6+ A’P.AT, K,), (39a) 
and 
&P[T, M(6)] S$ e(T) S & + APT, M(5.)], (39b) 


where K, = [A/26,]*, K_ = [A/26,]”, and M(6) is defined by inequality 
(28a). Thus é and é’ too are related to the digital error rates P, and P,. 
Of course, 6, and 6, may be chosen to yield the tightest bounds. 


Examples 


(7) Since we know the asymptotic value of P, and P, as T > ~, 
we can apply inequalities (39) to obtain estimates of the limiting values 
é = limit; @(T) and @ = limit,_.é (7). For example, when the 
channel capacity C is large, setting A/26, = exp [(C/ps)(1 + A,)] 
and A/26. = exp [(C/ps)(1 — A,)](A; , A, > 0), yields, after some 
computation, 


j= em{— Ca tec}, (40a) 


a 
Cc 
l 


é = exp \-2 [1 + xo ; (40b) 
where é, , & —-0as C — o. Thus for large C, € and € decay roughly 
exponentially in C. 

Let us remark that parts of inequalities (40) are obtainable by other 
means. Specifically, € 2 K,(s) exp [—sC/ps]| follows from inequality 
(16). Further, «| < exp [—(sC/ps)(1 + &)] and @ S exp [—(sC/ps) 


-(1 + &,)] can be deduced from the work of Panter and Dite on quanti- 
zation,’ Finally the bound é¢ = exp [—(sC/ps)(1 + £)] is new. 
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(it) In this example, we apply the first inequality of (89a) to show the 
possible gains (with the sth mean criterion) obtainable by using coding 
in a particular (though quite typical) case. 

Suppose that the channel is the additive white Gaussian noise channel 
with average power P, , one-sided spectral density N, , with no band- 
width constraint.° To begin with, suppose 7’ = 1/ps, so that n = 1 
and there is no “‘coding’’, that is, each T-second channel input depends 
on exactly one source output. When the source is the K-ary digital source 
(with equi-distributed symbols), it is known that the minimum attain- 
able error rate is lower bounded byt 


: KPor. 3 
ma = aa \ ey) 


Ot we. / "edu, 


P.(T, K) 


IV 


where 


is the cumulative error function. 

We now apply the lower bound of inequality (39a) together with in- 
equality (41) to obtain a lower bound on é (7) when the channel 
signal power P, made large, while T = 1/ps is held fixed. Setting 
6, = P>', we obtain from inequalities (89a) and (41) and (a) ~ 
(27a”)"* exp (—a’/2) (as a > ©), that (with 7 = p;! held fixed) 





a oe Po 
em) =¢(1)2en{- +e} a 
where £;(Po) ~0 as Po > ~. 

Now suppose that for a given channel (and a given P,) we allow T 
to become large. In other words, we permit ‘‘source coding” in blocks of 
length n = pgT’. Since the channel capacity C = P,/N» , we have from 
equation (40a) that 


limit é(T) = & = exp es [1 + Ecol) ) (43) 





where &(P,) > 0 as Py) > &. 
Now let 6 > 0 be arbitrary, and let P, be sufficiently large so that 
for P) 2 P,, 
| (Po) |, | £a(Po) | < 4. 
Then from inequality (42), with Pp) 2 P, , the best attainable mean sth 


_ | This bound follows from Ref. 1 [equation (82)] when the signal energy nP 
in that reference is replaced by PoT7' our signal energy, and M is replaced by K. 
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error with no coding is bounded by 
fil | P5 | 
=) > —_ é 4 
é (4 => exp INops (1 + 8) (44) 


The best attainable sth error with infinite delay T is from equation 
(43) with P, = P, , bounded by 


; é < exp | - Wa ee |. (45) 








We conclude that coding with large delay offers a saving of at least a 
factor of (2s) in power P, or rate ps (when Py = P,). This of course is 
interesting when s > 4. Similar results for s = 2 have been derived by 
Ziv and Zakai.’ This result can be generalized to arbitrary n (here we 
studied 2 = 1) and arbitrary channels simply by using appropriate 
bounds on P,(T, K). 


III. PROOFS OF THEOREMS 
3.1 Proof of Theorem 1 and Related Examples 


3.1.1 Proof of Theorem 1 


Shannon [Ref., 2, pp. 155-156] has shown that for a difference dis- 
tortion measure d(x, £) = r(x — 4), that 


F.a(8) = Hs ae &(8), (46) 


where Hs is given by equation (15b) and (8) is the maximum attain- 
able entropy H {f(x)} for a probability density f(x) which satisfies 


J r@i@) de s 8. (47) 
The entropy H {f(x)} is defined by 
HY@} = —[ 4@) log 1) ax. (48) 


A trivial modification of Shannon’s argument shows that when X = 
[—A/2, A/2], inequality (46) remains valid if f(x) is further restricted 
to satisfy 


fx) = 0, |2|> “. (49) 


Now the density g(x) [defined by expressions (13) and (14a)] satisfies 
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conditions (47) and (49) and has entropy H,(@) [defined by equation 
(14)]. We prove Theorem 1 by showing that if the density f(x) satisfies 
conditions (47) and (49), then H{f(a)} < H,(8). 

Let us write gg(x) = Be“ where \ = \,(@) and where 


A/2 = 
—Xo 
B= [ e (B) r(x) dx 2 
-A/2 


A/2 
—[ gola) log ga(x) ax 
~A/2 


Then 


H, (8) 


A/2 

—log B+ < r(x)ge(x) dx = —log B + dB. 
—-A/2 

Since f(x) satisfies condition (47), 


H,(8) 2 —logB +2 [ r(e)f(a) dz 


-| f(x) log Be”? dx = | f(x) log ga(x) dx. 


Thus 


H{f@)) — HG) S—] 4G) log fe) de + J 4(2) log gp(e) ax 


= -f. f(x) log 2 ie a dx < i jo)| #2 = | en ee ee 


where the second inequality follows from log u S u — 1. Theorem 1 
follows. 

Note that Theorem 1 will hold for A = © as long as we can find 
gg(). Examination of the derivation which establishes the existence of 
g(x) (Ref. 3, Appendix A) shows that Theorem 1 is valid in particular 
for A = © and r(x) = |2|’,s > 0. 





3.1.2 Determination of R., (8) in Example (122) 
For % = [—A/2, A/2], Ps(z) = A™’, and d(x, £) = r(x — #£), where 
r(u) satisfies conditions (12); Hs = log A. Theorem 1 implies 
R.q(B) 2 log A — H,(8). (50) 


We now show that if, in addition, r(w) satisfies equation (18), then 
inequality (50) is satisfied with equality. Let X and X be random varia- 
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bles such that the density for X is Ps(x) = A7'(|2| S A/2), and X = 
X + Y where the random variable Y is independent of X and has density 
ga(y) = Be” [defined by equations (13) and (14a)]. The information 
of p(x, £), the joint density for X, X, is 


Hote, #)) = Hie) — [Pst inte 





x)} dx, 


where p2(4) is the density for X, p(4 | x) is the conditional density for 
X given that X = x, and H{ } is the entropy defined in equation (48). 
Now p(z, 2) = Ps(x)p(é| x) = A7’Be™’?-, so that 


A/2 . E£+A/2 
9(£) = AB | ee" F-) dy = AB rere) aire 
—A/2 £-A/2 
when ¢ = 0 this becomes, letting v = u — A and using equation (18) 
A/2 E+A/2 
p(é) = A’B ee" dy + A™'B ena 
£-A/2 A/2 
A/2 £-A/2 
= A'B ee" du + AB gc BY dys 
£-A/2 —A/2 


Hence, since f gs(x) = 1, 
A/2 
p2(2) a 4B | Pasa du ay A, 
-A/2 


For ¢ < 0, asimilar proof yields p.(#) = A~*. Thus H{p.(4)} = log A. 
Further p(é |x) = gs(@ — x), and a similar use of equation (18) yields 
H{p(é | x)} = A,(B), independent of x Thus we conclude that 
I{p(a, £)} = log A — H,(8). Since p(a, £) « NV(B), this and inequality 
(50) imply &.,(8) = log A — H,(6). 


3.1.3 Proof for Example (iv) 


We first verify equation (21) for the case A/(26) = K, , an integer. 
That R,,(8) is greater than or equal to the right member of equation 
(21) follows from inequality (20) (since Hs = log A) and from R,,(8) 
= 0. To show that F,,(8) is less than or equal to the right member of 
expression (21) we produce a density po(x, #) for which I{p (az, £)} 
equals the right member of equation (21). But first we digress to define 
“entropy” for a discrete random variable. 

Consider a discrete probability density f(x) = >); a; 6(« — x;). Then 
the ‘‘discrete entropy of f(x) is defined by 


Ayif(z)\ = — 24 log a; . (51) 
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Now, say that p(x, £) is the probability density for two random vari- 
ables X, X, such that X takes values at a countable number of points. 
Then the marginal density for X, denoted p.(#) and the conditional 
density for X given X = x [denoted p(¢ | x)] are discrete densities. It is 
easy to show that the information can be written 


ple, #)} = Holplé)} — f[ pic)Holpalé |} dz, (52) 


where 7,(x) is the marginal density for X. 

Return now to Example (iv). Let 0 S B S (Ky — 1)/Ko, and let 
Po(x, £) be the density for X, X, where X has density Ps(x) = A~ and 
X has conditional density p.(¢ | x) given as follows. Partition the 
interval [—A/2, A/2] into K, subintervals {7;}*°"' of width 26. Let 2; 
be the midpoint of J; (¢@ = 0,1, 2, --- , Ko — 1). Then for ve J; 


po(é |2) = (1 — 6) (@ — 2) + TeReEY > a(¢ — a). 

In other words, X is an imperfectly quantized version of X. With proba- 
bility (1 — 8), X is the midpoint of the subinterval in which X lies, and 
with probability 6, X is uniformly distributed among the remaining 
(Ky — 1) midpoints. Note that Ps(x) and p,.(4 | x) together determine 
po(z, £), and that po(x, £) ¢ SI(8). 

Further, by symmetry, X is uniformly distributed on the Ky mid- 
points, so that 


H p{ Do2o(£)} = log Ko , 


where 72(4) is the marginal density for X [corresponding to po(z, £)]. 
Also 


Hy{ po(é|x)} = A(B) + B log (Ky — 1), 
independent of x. Thus equation (52) yields 
I{po(x, £)} = log Ko — h(B) — B log (Ky — 1), 


the right member of expression (21). This establishes equation (21) for 
0<86 S (K, — 1)/K.. Since R.,.[(Ko — 1)/Ko] = 0 and R,, (8) is non- 
increasing, we have R,,(8) = 0 for 8B = (Ky — 1)/Ko, establishing 
expression (21). 

It remains to verify the upper bound of expressions (22). But this 
follows immediately on noting that for fixed A and 6, R.,(8) is a decreas- 
ing function of 5. Thus decreasing 6 to 6’ = A/2[A/26]* results in an 
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increase in R,,(8). Since A/(26’) is an integer, we can apply expression 
(21) to obtain the upper bound of expression (22). 


3.2 Proof of Theorem 2 


Theorem 2 relates the attainable distortions for a digital source and 
an analog source when connected to a given channel. The proof is in 
two parts [corresponding to the two inequalities in expression (23)], 
the second of which uses a bounding technique introduced by Ziv and 
Zakai.” 

In part (2) we are given an encoder and decoder for the digital source 
(with appropriate parameters), which when connected to the channel 
as in Fig. 1 results in an average Hamming distortion d = dy . We show 
how to quantize the outputs of the analog source (with appropriate 
parameters) to essentially simulate the digital source. When this 
quantizer is connected to the digital encoder, we show that we attain 
an average distortion for the analog source d; S dy. This leads us 
directly to the second inequality of expression (28). 

In part (iz) we establish the first inequality of expression (23) in an 
essentially dual way. We begin by assuming the existence of an analog 
encoder and decoder. We then show how to modulate the outputs of 
the digital source to virtually simulate the analog source. Unfortunately, 
this is not as easy as the quantization in part (7), and we have to make 
use of an “‘averaging”’ argument in the course of the proof. 


(t) Let us denote by 8, , the analog source whose output is a sequence 
X,, X2, ++: of independent random variables, each uniformly dis- 
tributed on the source space ©, = [—A/2, A/2]. The random variables 
appear at a rate of ps per second. For this source we use the distortion 
d(x, 2) = d;(x, #) defined by equations (19). Assume first that A/(26) = 
K, an integer, and consider the following (uniform) quantizer. Partition 
the interval [—A/2, A/2] into Ky subintervals {7;}3°~' of width (268) 
where 

Te = Ces Chak 7=0,1,---,K, —1, (58a) 
and 


a | (; = Ka) ee eo oe (53b) 


To be precise, the first interval J) should be closed on the left. The quan- 
tizer q is defined by 


A 


q(x) = 4, if wel; (-4 = 2 A). (54) 


IIA 
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Let us now consider the digital source &; whose output is a sequence 
S,, S2, °°- of independent discrete random variables, each uniformly 
distributed on the Ko-ary set X, = {0,1, --- , Ko — 1}. These random 
variables also appear at ps per second. (Note that we use S; instead of 
X, as in Section IJ to distinguish the outputs of $, from those of 8, .) 
Say that the distortion d = d, as defined in equation (5). 

Suppose that $; can be connected with delay T to a channel as in 
Fig. 1 with (digital) encoder f‘” and decoder f”, and average distortion 
dy. Wenow show how to connect the “analog” source 8, to the 
channel [with the help of f¥? and fi] to attain an average distortion 
d; < d,. Consider the system in Fig. 4. In T seconds the output of 


the analog source is an n-vector (n = psT)KX = (X,,--- , X,). The 
“quantizer” output is the n-vectorS = (S,, S2,°-: , S,), where S, = 
q(X,) (k = 1,2, --- ,n). Note that the S, are independent and uniformly 


distributed on {0, 1, --- , Ky — 1}, as are the outputs of the digital 
source $,. The digital encoder and decoder f” and fS” are as given 


above, and the output of the latter is the Ko-ary vector S = 
(S,;, °°: , S,). Thus 


Ea™ (S, S) = di . 
The “converter” output is the n-vector K = (X,, X., --- , X,) where 
x = (28, = Ko + 1)6. 


In other words if S, = 7, then X, is the midpoint (e; + ¢;11)/2 of the 
ith subinterval. Disregarding the case when X, is equal to one of the 
endpoints e; of the subintervals, (an event with zero eta emcee it is 
clear that |X, — X,| = 6 if and only if S, # S, (k = 1,2,---,n). 
Thus 


ds = Eds” (X, %) 


Edy? (8, 8) = dy. 
It follows that 


Q(T, A, 8) < p,(7, Al (55) 


when A/(26) 1s an integer. The second inequality of expression (23) 


DIGITAL | DIGITAL 
euunee ENCODER ENCODER CON- 
! td)  (d) VERTER 
E 9) 





Fig. 4 — An analog communication scheme. 
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follows on noting that Q(T, A, 6) is a nonincreasing function of 6. Thus 
decreasing 6 to 6’ = A/2K., does not result in a decrease in Q(T, A, 4). 
Since A/(26’) is an integer, we can apply inequality (55) to obtain the 
second inequality of expression (23). This completes part (2). 

(272) Let us suppose that the analog source 8, defined in part (2) is 
connected with delay 7 to a channel as in Fig. 1. The T-second source 
output is the n-vector K = (X,, X.,--- , X,) and the decoder output 
is the n-vector KX = (X,, X2,--:: , X,). Say we attain an average dis- 
tortion 


ds = Edi (X, x) 
Letting E[d\” (X, X) | X = x] be the conditional expectation of d\” (X, X) 
given X = x, we can write 


d= | Hfds? |X = x) an dx. (56) 
[-—A/2,A/2]” 


Suppose that (4/26) = K,, an integer. Let us partition the interval 
[—A/2, A/2] into K, subintervals of width 26 as in equations (53). 
Let & be the set of left end-points of these subintervals, that is, 


& = fe }ier'. (57) 


Now consider the n-cube [—A/2, A/2]". Note that the random n-vector 
X is uniformly distributed on this cube. The partition of the interval 
[—A/2, A/2] defines a partition of the n-cube into K* subcubes, each 
the product of n subintervals. Let the members of &” be denoted by the | 
n-vectors —;,7 = 1, --- , KG, and let C; be the corresponding subcube. 
(That is, C; is the product of the subintervals whose left end-points are 
the coordinates of &; .) Then clearly, 


A Ay’. 
| -2 ) y, | —— dX C; ) 
where >, denotes disjoint union. Thus we can rewrite equation (56) as 


K%, 
ise | +5 Bla | X = x] dx 
g=1 0 C; 


Ko 1 
= —- ~~, E[d;” |X = &,; da, 58 
2 Ki ee (28) [ 8 | oe a| Q@ ( ) 
where the second equality follows from the change of variable of integra- 
tion to a = x — &; , and the fact that A = 25K. 
Some insight into what we have done may be gained by considering 
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the special case where Ky = 2 andn = 2. In this case the n-cube 
[—A/2, A/2]" is a square, and there are K? = 2” = 4 members of &" 
denoted £, , & , & , and & . (See Fig. 5.) The subcubes are C, , C. , C3 , 
and C’, as indicated. 

Let us consider now the digital source 8, defined in part (¢) whose 
output is the sequence S, , S., --- . We would like to transmit the out- 
puts of § through a channel (as in Fig. 1) with delay 7, so that the 
source output must be an n-vector (n = psT')S, and the decoder output 
an n-vector S. The fidelity criterion is 


dy = td (S, S). 


Now suppose that we are given an encoder-decoder, f{”, {%’, for the 


analog source §, [for which A/(26) = K,], connected with delay T’, to 
a given channel. Say this encoder-decoder attains an average distortion 
ds. We show that there exists an encoder-decoder for the Ky-ary 
digital source 8; , connected with delay 7’, to the same channel such that 
the average distortion dy < d;. From this we deduce immediately 
that for A/(26) = Ky, 


P.(T, Ko) S Q(T, A, 8). (59) 


The digital encoder is given schematically in Fig. 6a. The analog en- 
coder which we are given is f(x), x e [—A/2, A/2]", and is realized in 
the right box of Fig. 6a. The function of the ‘‘modulator’’ is to assign to 
each n-vector 8 {0, 1, --- , Ko — 1}", a member of [—A/2, A/2]”. This 
is done as follows. Let & be the set defined by equation (57). For se 
{0, 1, 2, : » Ko = 1}, let 








Fig. 5— A digital encoder. 
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Fig. 6— (a) Digital to analog encoder. (b) Analog to digital decoder. (c) Digital 


communication scheme. 
K. 
a) = @8)|s— | 


be the sth member of & Fors = (s,, 82, ++: ,8,) 2 {0,1,--- ,Ko — 1}", 
let 


g”(s) = [9(s:), 9(s2), +++ » g(Sn)]. 
When the input to the modulator is s, its output is 
« + 9 (s), 


where @ = (a1, @2, °°: , @n) & [0, 26]” is a fixed vector. Thus the digital 
encoder is 


fe (s) = fle + g™(s)]. 


The digital decoder is given schematically in Fig. 6b. The left box is 
the analog decoder f; which we are given. Its output < is a real n- 
vector. The right box is a quantizer. When its input is = (4,,---, £,), 
its output is g,(%) = § = (8, --- , 8), where ,(k = 1,2, ---,n)isa 
member of {0, 1, --- , K) — 1} which minimizes | g,(é,) + a, — 4%; |. 

When the digital source 8, is connected to the channel with this en- 
coder-decoder pair, the result is schematized in Fig. 6c. (Upper case 
&’s and S’s are used to signify random variables.) The portion of the 
system in the dotted lines is precisely the analog encoder-channel- 
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decoder which would produce an average distortion ds; given by equa- 
tion (56), 7f the analog input, X, where uniformly distributed on the 
n-cube [—A/2, A/2]”. But this is not the case here. In fact, X takes only 
one of K* possible values. However, the quantity E[d‘(X, X) | X = x] 
is exactly the same in the system of Fig. 6c as in equation (56), for x = 
g(s) + a(se {0, +--+ , Ko — 1}"). 

Let us write an expression for the average distortion d, for the 
digital source. Note that S, ~ 8,, only if |X, — [g(S,) + a]| 2 6. 
Thus 


dip (S, 8) < d&[g™(S) + a, X], 
and 


dy = Ed,(S, S) 


IA 


De BlrK,D|X=9"@) +e, 0) 


where >), is the sum over the K* equally-likely values of s. Let us now 
average the right member of expression (60) over all « in [0, 26]", with 
« assumed to be uniformly distributed. That average is 


dx 1 Ps 

SX oa Blds(K, %) |X = (8) + ol. 
= (26) mu K; [ a( ) g ( ) ae «| 

If we note that the set {g‘(s)} are in one-to-one correspondence with 

the K3 members é; of &”, this quantity may be written as 


K%> 1 


Tn 5; Eld X, X xX = j di : 
KG to.231" (20) [ds¢ ) | é; + @] da 


which equals d; by equation (58). Since there must be at least one 
value of @ for which the right member of expression (60) is as small 
as the average, we have proved inequality (59). 

The first inequality of expression (23) follows from inequality (59) 
on noting as in part (z) that Q(7', A, 6) is a decreasing function of 6. 


3.3 Proof of Theorems 4 and 5 


Since Theorem 5 includes Theorem 4 as a special case we need only 
give a proof of Theorem 5. Our task is further simplified since the basic 
idea of the proof of Theorem 5 is the same as in Theorem 2 (Section 3.2). 
Here too we break the proof into two parts. In part (7) we assume that 
we are given an encoder-decoder for the digital source and deduce the 
existence of an encoder-decoder for the general source (which plays the 
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part of the analog source in Theorem 2). In part (iz) we do the opposite. 
However we do not have the complications here which necessitated an 
averaging argument in Section 3.2. 


(1) We prove here that G(T, 6) < P.[T, M¢(8)], the second inequality 
of expression (83). The proof parallels that of part (7) in Section 3.2. 
Instead of the analog source space X, we have here a general space &. 
The distortion is d;(x, £) with | — #| replaced by po(z, £). 


To transmit the source outputs which belong to X we use the system 
in Fig. 4. The digital encoder-decoder is for a Ky-ary source where 
Ky = M,(é). We assume that it attains a guaranteed distortion d, . 
The quantizer is defined as follows. Let {8;}3°7' be a minimum 6-cover- 
ing of X. For xe X, let q(x) be the smallest 7(0 S 7 S Ky — 1) such that 
xe S,,(6). Then if x = (4, , 22, °°: , %,) © LX” is the source output, the 
quantizer output iss = q’(x) = [¢(x1), q(t), --- , g(a,)]. The output 
of the digital decoder isS = (8, , 8,, --- , S,) and the converter output 
is X = (x; yote ae where ¥, = 8; when 8; = 7. Clearly, if S, = Se 
then p(X, , X;,) < 6. Thus for any source output x, 


ds(x) S dul” (1 S 


as 
so that the overall guaranteed distortion d; < dj, , from which part (7) 
follows. 


(i2) We prove here that P,[7, Mp(6)] < G(T, 4), the first inequality 
of expression (83). As in part (2), the proof of part (7) parallels that in 
Section 3.2. Again 2, is replaced by X and | x — #| by po(a, 4). 


As in Section 3.2, we assume that we are given an encoder-decoder 
for the general source with guaranteed distortion d; . We set Ky = M>p(8) 
and use the system of Fig. 6 to transmit the outputs of the K,-ary digital 
source. The modulator is defined as follows. Let {8;}*25' be a minimum 


6-packing of X. If source output iss = (8; , S2, °*- ,S,), then the modula- 
output is i (s) = (8,,,8.,,°°° » B,,). The output of the pay is 
= (Xi port yx n)> and the quantizer output isS = (S,,8.,---,8,), 


coe &, = Poe oe Se, (6). If X;, ¢ Sg,(6) for allt (0 S27 S Ky — 1), 
then §, = 0. 

Clearly, if po(X;, , X,) < 6, then S, = S, . Thus for any source output 
s, the conditional expectation 


du(s) S dlg(s)] Sd . 
< 


Thus the overall guaranteed distortion is dy 
of part (27) and the theorem. 


Aa 


d; , completing the proof 
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3.4 Proofs on Packing and Covering 


In this section we give a proof of Theorem 38, the main part of which 
is a lemma on covering of the K-ary n-cube. We also prove Lemma 1 
relating packing and covering in Section 2.4. 


3.4.1 Proof of Theorem 3 
We first establish the following lemma. 
Lemma 2: Let 0(0 < 6 < (K — 1)/K) be arbitrary, and let r satisfy 
R.,(0) <r < log K, 


where R,,(A) zs the equivalent rate for the K-ary source given by expressions 
(9). Using the terminology of Section 2.4, let % = {0,1, ---, K — 1}" 
(the K-ary n-cube) and p)(x, X) = di? (x, x). Then for n sufficiently large, 
there exists a 6-covering of X with M = e™ points. 


Proof: Let {x;}* be a set of K-ary n-vectors. Let F(x, , X2, +++ , Xa) 


be the number of members % of X such that d‘” (x; , %;) 2 6 for all 
7=1,2,---,M.IfF = 0, then {x,}}" is a 6-covering of Xx. We can write 


FR, °°: Xa) = 22 Om » Xa), 
xe 


where 


if d(x, .x>= — Stig 
a re , if d(x, ,%) = 0, all i= 1,2,---,M, 
0 otherwise. 
Now consider an experiment in which M = e”™ n-vectors {X,}/" are 
chosen at random from & independently with identical (uniform) dis- 
tribution 


Pr {X, — x} > Kk™, 


Then F(X, , X., °°: , Xar) is random variable with expectation 
EF = >) F@(&,X, ,X.,--: ,Xw), 
XeX 


where, as indicated, #@is computed with X held fixed. Now for a given X, 


E0(%,X,, °°. , Xa) = Pr {@ = 1} 
M 


= Pr () {di (x, X;) 2 0;} 


t=] 


= [Pr {di(&, Xi) = 0})", 
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where the last equality follows from the independence and identical 
distribution of the random vectors {X;}. Letting a, = Yoosicon fe). 
(K — 1)'K~ be the probability that dy,(%, X;) < @n, we have 
E&(%, X,, °°: , Ka) = 1 —a,)”" S eo, 
independent of x. Thus 
EF S< Me*"", 


Now it is well known (see for example, Ref. 8, p. 178) that for 0 < 
6< (K — 1)/K,asn—- @, 


Gn, = € 
Thus since M = 2° andr > R.,(9), 


agn - 6 
E(F) < Me a®M = er exp {—e!" Reaq( ese — 0, as N— ©. 


—nReq(@) +0(n) 


Now, there must be at least one particular set {x;}/’ such that 
FR, ,X:, be > Xm) < EF, 


Thus if we choose 7 large enough so that E(F) < 1, F(x, -++ Xy) = 0 
(since F is an integer valued function). Thus {x;}}’ is the required 
covering. 

The proof of Theorem 3 now follows the standard proof of a source- 
channel coding theorem, with Lemma 2 playing the role of the source 
coding theorem. (See Ref. 2.) Roughly speaking the proof is as follows. 
When 7(K, ps, C) = (K — 1)/K, the entire theorem is trivial, since 
we can attain a guaranteed distortion of (K — 1)/K without even using 
the channel by simply letting the decoder outputs take the value 
20 S$ 7 S K — 1) with probability 1/K. Thus assume that 0 S$ y < 
(Kk — 1)/K. 

The channel can transmit e”? (where R < C, the channel capacity) 
in T seconds with arbitrarily high reliability (see Section 2.1). By the 
definition of y = 7(K, pg , C) [expression (10b)], 


Realy) S C/ps . (61) 

Let « > 0 be arbitrary. In Lemma 2, let r = C/ps — e: , where e, > 0 
will be chosen below. Then approximate the T-second source output 
(a Ky-ary n-vector, 2 = psT') by a (covering) set with e” = e7°’7 mem- 
bers. Since rps < C we can transmit these n-vectors through the channel 
with arbitrarily high reliability. Further, with « > 0 arbitrary, and if 


tS Regl¥ + ¢), (62) 
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we have from Lemma 2 that the error in making the approximation 


will always be less than or equal to (y + e) for T sufficiently large. In 
fact, if we set 


a= Rea) = Rady 5 ‘) > 0 


[since #(y) as defined in equation (9) is strictly decreasing for y < 
(kK — 1)/K], then [using inequality (61)] 


a ae = © _ p.4) + Ray + $) 
Ps Ps 


Rady + £) > Rely + ©) 


IV 


and condition (62) is satisfied. 
We conclude that for 7 sufficiently large, we can make 


dix =. =e 
for arbitrary « > 0. Thus 
limit P,(T, K) Sy te—y, as «0, 


T-0 
which is Theorem 3. 


3.4.2 Proof of Lemma 1 


We say that A C & is a ‘maximal A-packing”’ if A is a A-packing, 
and if for all v ¢g A, the union {v} U A is not a A-packing. We establish 
Lemma, 1 by showing that every maximal A-packing is a (27A)-cover- 
ing. Let A be a maximal A-packing. If A is not a (27A)-covering, then 
there exists a Vp e X such that po(vo, u) > 27A, for all wu e A. From con- 
dition (30b), v ¢ A. We claim that {v.} U A is a A-packing, con- 
tradicting the maximality of A. If w ¢ S,,(A), then for all ue A (using 
the definition of 7) 


Poo» U) S nlpo(Yo , w) + polw, u)], 
so that 


Polo td) 2nd 


pow, u) 2 = opaas W hey ee 


Thus w ¢ S,(A) and {v,} U A is a A-packing, establishing the lemma. 


APPENDIX 


List of Symbols 


oe 
P(x) 


\*(T, N) 


X 
d(x, €) 


dy (x, £) 
a" (x, %) 
d 

d*(T’) 
ds ay £) 
d(x) 

d 


d*(L) 


Q(T, A, 5) 
Q(T, 6, A) 


PT, K) 
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the source output space 

the source probability density function 
source output rate (symbols per second) 
(x; ¢ X) the 7th output of the source 

(Gi Xe, +** @,) 2 

set of “allowable” channel inputs 

the set of all channel outputs 

the coding delay 

ps I 

the encoding function, fz(x) ¢ Wr 

the decoding function, fp(z) 2 2” 

the decoded n-vector, ¥ = fp(z) e X” 
number of code words in a code 

1/T log N, the rate of a code 

the word probability of error 

smallest attainable word error probability for a code 
with parameters NV and T 

average probability of error 

the distortion function 


0, L= 2 
L; ge 


= 1/n De d(xx , €x) 
= Kd” (x, X) 


the smallest attainable d for a given delay T 
| 1 [a —-#| <6 


lo |a—#é|26 


the expectation of d”(x, X) given x 

SUPzexn A(X) 

the smallest attainable value of d for a given delay 7 
d*(T) for d3(x, £) and x e[—A/2, A/2] 

d*(T) for d;(x, £) and « e [—A/2, A/2] 

the minimum attainable per symbol error rate for an 
equiprobable K-ary memoryless source 


VK, ps ,C) = limron P.(T, K) 
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C 


the channel capacity 


P.(T, K) the minimum attainable guaranteed per symbol error rate 


for a K-ary source 


G(T, 54) generalization of Q, defined in Section 2.4 
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On Digital Communication Over a 
Discrete-Time Gaussian Channel 


with Noisy Feedback 


By AARON D. WYNER 
(Manuscript received April 21, 1969) 


We consider the problem of transmission of digital data over a discrete- 
time Gaussian channel with the use of a Gaussian feedback channel. We 
are particularly interested in the case where the signal-to-noise ratio in the 
feedback channel is finite. By making use of simple extension of P. Elias’ 
scheme for transmitting analog data over this channel with feedback, we 
show that it 1s possible at some transmission rates to increase the error- 
exponent (reliability) compared to the error-exponent found by C. E. 
Shannon for the one-way channel. In particular at transmission rate zero, 
we show that the error-exponent can be improved by a factor of 1 + 
[6/(1 + p)], where p and # are the forward and feedback signal-to-noise 
ratios respectively. 


I. INTRODUCTION 


We consider the problem of transmission of digital data over a dis- 
crete-time Gaussian channel with the use of a Gaussian feedback channel. 
We are particularly interested in the case where the signal-to-noise 
ratio in the feedback channel is finite. 

In Sections II and III we consider Elias’ scheme and a simple exten- 
sion for transmitting analog data over this channel with feedback.*’? 
In Section IV we apply this extended Elias scheme to the digital trans- 
mission problem. The main result is that for any rate R < R*, a number 
less than the channel capacity, it is possible to transmit digital data at 
a rate R with error probability 


P, = exp [—£*n, + o(n,)], as n> &, 
where n, is the encoding-decoding delay, and #* > E, , the “one-way” 


exponent estimated by Shannon.’ In particular, when R = 0, EF, = p/4 
and H* = (p/4)[1 + A(1 + p)~"], where p and @ are the forward and 
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feedback signal-to-noise ratios respectively. Finally, we suggest a 
modfication of this scheme which will probably permit extending R* to 
capacity. 

Stimulated by the work of Schalkwijk and Kailath, a great deal of 
research has been done on this problem (see for example Refs. 4—11). 
To the present author’s knowledge, however, the result in this paper 
is the first to show that a noisy feedback channel can improve the error- 
exponent for digital communication on a band-limited channel. (Ref- 
erences 4 and 8 treat the infinite band case.) Like the optimal coding 
schemes for the one-way channel, our scheme is not constructive. Let 
us remark here that this discrete-time channel is a model for the con- 
tinuous-time Gaussian channel with a bandwidth constraint. (See Ref. 
12 or 18.) 


II. STATEMENT OF ELIAS’ PROBLEM 


We define a Gaussian channel as follows. The input is a real number 
x and the output is anumber y = x + z, where the ‘‘noise”’ zis a Gaussian 
variate with mean zero and variance o° and is independent of x. We 
assume here that the channel input x is a random variable, and require 
that the expectation Ex’ < P, the “‘signal power’’. 

To begin with, let us suppose that we wish to transmit the value of a 
random variable 6 with the use of N transmissions over a Gaussian chan- 
nel (with parameters P and o”). Assume also that a feedback Gaussian 
channel (with parameters P and ¢é*) is available which we may use 
(N — 1) times alternating with the N forward uses. We assume nothing 
about the statistical nature of 6 except that the expectation Hé” = 0}. 
Our goal is to obtain an unbiased estimate 6 of 6 with minimum possible 
mean-squared error. Further, we restrict ourselves to linear processing 
of all data. We now state the problem and constraints precisely. 

The forward and feedback channels are memoryless Gaussian channels 
with signal power P and P respectively and noise power o® and é” 
respectively. Thus for the nth use of the forward channel the input is 
zx, and the output is y, = x, + z,, where Hx? = P and z, is a Gaussian 
variate (independent of z,) with mean zero and variance o. For the 
nth use of the feedback channel the input is ¢, and the output is J, = 
#, + 4,, where E42 = P and 4, is a Gaussian variate (independent of 
£,) with mean zero and variance 6”. We assume that the random varia- 
bles {6, z, , 2,} are independent. The condition requiring ‘‘linear process- 
ing’ means the following. The input 2, to the forward channel (at the 
nth use) is given by 
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n-1 
t= a9, tn = OnO+ DY dufe, nm = 2,8,°°-,N. (1) 


k=1 


The input to the feedback channel £, (at the nth use) is given by 
fn = >> CuYn mse 12,097, N — I. (2) 
k=l 


Tinally, the receiver’s estimate after N uses of the forward channel 
(and N — 1 uses of the feedback channel) is 


N 


We require that 6 be unbiased, that is, that given that @ = 6,, the 
conditional expectation of 6 is 


E(6|6 = 0.) = 6. (4) 
The mean squared-error, which we wish to minimize is 
7 = E(6 — 6)’. (5) 
Let yap, be the minimum attainable value of y* (over all choices of 
the coefficients a, , Daz Caz» Un). It is easy to show that 


(i) yp, depends on P and o” only through their ratio p & P/o’ 
(the forward “signal-to-noise” ratio), and on P and é* only through 
a A a2 
p = P/é’. 

(it) for a given N, p, and #, y}pz is proportional to a; . 

Thus we can write 


2 2.2 A 
YorTt = T€oPT(p, Pp; N), 


and our problem reduces to the determination of ¢$p7(p, 6, N) (which 
can be thought of as a noise-to-signal ratio). 

Let us observe that from the linearity assumptions (equation 1, 2, 
and 3) it follows that 


6 = a ae g, (6) 
where a is a constant and é is a Gaussian variate independent of 6. From 


equation (4) it follows that a = 1 and HE = O, and from equation (5) 
Ee’ = y’. Thus we can rewrite equation (6) as 


where & is a Gaussian variate (independent of 6) with mean zero and 
variance y*. The important point here is that the entire process may 
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be thought of as reducing the N uses of the forward channel (and the 
N — 1 uses of the feedback channel) to asingle one-way Gaussian channel 
with signal-to-noise ratio (6")/(E#’) = o3/y’. 


Ill. ELIAS’ RESULT 


Elias solved our problem for the special case N = 2, where two uses 
of the forward channel and one of the feedback channel are permitted.*"” 
In his solution Elias admits the possibility that for the two uses of the 
forward channel, the signal-to-noise ratios are p,; and p2 respectively, 
where p; is not necessarily equal to p.. His result is that the smallest 
attainable mean-squared error is given by 


A -1 
2 2 PiP2p 

= ale FPS Gp) + ps) + fl ) ss 
As discussed at the end of Section II, we can consider the entire process 
as a single one-way gaussian channel with signal-to-noise ratio 03/72 . 
We now turn to our problem, and note that we can obtain a (suboptimal) 
solution by applying Elias’ technique recursively. For N = 2 we can, by 
setting p, = p2 = p in equation (8), obtain a signal-to-noise ratio 
S. = {2p + p6/[(1 + p)? + Al}. For N = 3 we can, by setting p, = S, 
and p. = p, obtain a signal-to-noise ratio S; given by 


a ppSe 
Ss = [stot apes s|. 


and for arbitrary N we can obtain a signal-to-noise ratio Sy given by the 
recurrence 


— ppSn-1 
PONE OT a GN ep) ep (9a) 


with initial condition 
Si = p. (9b) 


Although equation (9) is difficult to solve explicitly we can obtain 
an approximate solution valid for large N. From equation (9a) 








Sya te S$ 8y S Svat et ayy: (10) 
so that 
oN < Sy < (, +5 7 -\W. (11) 
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We will show that as N — o, Sy is asymptotic to the right member of 
inequality (11). Let us rewrite equation (9a) 


Z Lt ete”. 12 
Sye1 Sy + ot (Aes |i +t bets ( ) 


Let Sy = [p + fo/(1 + p)|JN + oy, and expand the last term in equa- 
tion (12) into a power series in (1 + p + A)/[(1 + p)Sy]. We then obtain, 
after cancelling terms, 


pp _(d+pet+/ , 0+e+s)’ ea 
ete vt ts | (i+ pSy * A+ pee ae) 


From equation (11) we have that Sy = O(N), so that equation (13) 
becomes 








dva1 — by = —O(1/N), (14) 
and therefore 
dy = —O(log N). (15) 
Thus we conclude that 
Sy = E + il a bly - O(log N). (16) 


An exact solution for Sy, for various values of p, 6, and N is given in 
Table I. Sy’ provides an upper bound to «p-7 - 
Elias also found a lower bound to ep7 ; 


€opr = 1/[pN + ACN — 1)]. (17) 


This is the mean-squared error which results when the feedback chan- 
nel is reversed and used in the forward direction, and we are allowed 
to use the forward channel N times and the feedback channel (V — 1) 
times. Combining these results we have that 








[(o + AN] S corr S (e+ ate b\w ~ O(log wm) | (18) 


Let us remark here that the recurrence (9) can be solved exactly for 
the special case 6 = ©. In this case equation (9a) becomes 
Sy = Syl 0) (19) 


and the solution is 


= (heh p) Sek. (20) 
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TasBLeE I—Tue ExtTenpep Evias SCHEME 








FORWARD SNR 
ASYMP. 


a FEEDBACK SNR = ¢ 


pp z = BO 
fe, ASYMP. (0) (0) 





SNR = limy-. Sy Ls pet 





FORWARD SNR = 0.01 FEEDBACK SNR = 0.01 











ASYMP. SNR = 0.010099 ASYMP. E(O) = 2.52475H-03t 
N EQ. SNR = S, EQ. E(O) = Ey(0) CAPACITY = cy 
1 0.01 0.0025 4.97516E-03 
2 0.020001 2.50012E-03 4.95089E-03 
3 3 .00029E-02 2 .50024E-03 4.92693E-03 
4 4.00058E-02 2.50036E-03 4 .90328E-03 
5 5.00095E-02 2.50048E-03 4.87992E-03 
6 6 .00142E-02 2.50059E-03 4.85686E-03 
7 7 .00197E-02 2.50071 1-03 4.83408E-03 
8 8 .00262E-02 2.50082E-03 4 ,81158E-03 
9 9 .00334E-02 2.50093 E-03 4.78935E-03 

10 0.100042 2.50104E-03 4.76740E-03 

FORWARD SNR = 0.01 FEEDBACK SNR = 0.1 

ASYMP. SNR = 1.09901E-02 ASYMP. E(O) = 2.747521-03 
N EQ. SNR = Sy EQ. E() = Ey(0) CAPACITY =cy 

1 0.01 0.0025 4.975161-03 

2 2 .00089E-02 2.50112E-03 4 .95284E-03 

3 3 .00266E-02 2 .50222E-03 4.93078E-03 

4 0.040053 2.50331 E-03 4.90895E-03 

5 5.00878E-02 2.504391-03 4.88738E-03 

6 6 .01309E-02 2.50546E-03 4.86604E-03 

7 7 .01823E-02 2.50651 E-03 4.844931-03 

8 8 .02417E-02 2.50755E-03 4 .82405E-03 

9 9.03091 E-02 2.50859 H-03 4.80340E-03 
10 0.100384 2.50961E-03 4.78297E-03 
20 0.20154 2.51924E-03 4.590091-03 
30 0.303354 2.52795H-03 4.41569E-03 
40 0.40574 2.53587H-03 4 .257041-03 
50 0.508622 2.54311E-03 4.11197H-03 
100 1.02871 2.57178E-03 3.53701E-03 
150 1.55532 2.59219E-03 3 .12725E-03 
200 2.0861 2.60762E-03 2.81727E-03 
250 2.61979 2.61979E-03 2 .57283H-03 
300 3.1556 2 .62967H-03 2.37410E-03 
350 3.69305 2.63789 E-03 2 .20869E-03 
400 4.23179 2.64487 E-03 2 .06844E-03 
450 4.77156 2.65087H-03 1.94771 E-03 
500 5.31219 2 .65609E-03 1 .842481-03 








t The notation “3E-5’’ means 3 X 1075. 
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Taste I—(Continued) 





FORWARD SNR = 0.01 





ASYMP. SNR = 0.019901 

N EQ. SNR = Sy 
1 0.01 
2 2..00495E-02 
3 3 .01483E-02 
4 0.040296 
5 5 .04925E-02 
10 0.102197 

15 0.155082 

20 0.209115 

40 0.436077 

60 0.678846 

80 0.935508 

100 1.20434 

300 4.31176 

500 7.79234 

700 11.4295 

900 15.1502 


FORWARD SNR 


ASYMP. 


= 


SCOONOORWNHe 


—_ 


FORWARD SNR 


ASYMP. 


2 


COON OUoURWNE 


— 


SNR 
EQ. SNR = Sy 


.02439 
.05731 
.09453 
13433 
17584 
.21857 
. 26222 
.80658 
.8d15 


COON OORWHH 


roe 





1 
SNR = 1.5 


EQ. SNR = Sy 


i 


hoe 


2 
3.4973 
4.84722 
6.22905 
7.63202 
9.04989 

10.4788 

11.9162 

13.3603 


FEEDBACK SNR = 1 
E(0) = 4.97525E-03 


3179 











ASYMP. 

EQ. E(0) = Ey) CAPACITY = cy 
0.0025 4.97516E-03 
2 .50619E-03 4.96279I-03 
2.51235E-03 4.95045E-03 
2.51850E-03 4.93816E-03 
2 .52463E-03 4.92591 E-03 
2.55493 E-03 4 .86529E-03 
2 .58470E-03 4.80572E-03 
2 .613894E-03 4..74722K-03 
2 .72548E-03 4.52394E-03 
2 .82852E-03 4.31755E-03 
2 .92346E-03 4.12731E-03 
3 .01085E-03 3 .95214H-03 
38 .59313E-03 2.78320E-03 
3.89617E-03 2.17388E-03 
4.08196E-03 1.80005E-03 
4 .20840E-03 1.54552E-03 

FEEDBACK SNR = 0.1 
ASYMP. —E(0) = 0.2625 
EQ. E(0) = Ey) | CAPACITY = cy 

0.25 0.346574 
0.253049 0.276677 
0.254776 0.23342 

0.255908 0.203521 
0.256716 0.18139 

0.257327 0.164227 
0.257806 0.150457 
0.258194 0.139122 
0.258516 0.129599 
0.258788 0.121468 

FEEDBACK SNR = 1 
ASYMP. E(O) = 0.375 
EQ. E() = Ey) | CAPACITY = cy 

0.25 0.346574 
0.275 0.290788 
0.291441 0.250579 
0.302951 0.220746 
0.311453 0.197811 
0.318001 0.179623 
0.32321 0.164826 
0.827462 0.152531 
0.331005 0.142138 
0.334007 0.183223 
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Tasie J—(Continued) 








FORWARD SNR = 1 FEEDBACK SNR = 100 
ASYMP. SNR = 51 ASYMP. EO) = 12.75 
N EQ. SNR = Sy EQ. E(0) = Ey(0) | CAPACITY = cy 
J 1 0.25 0.346574 
2 2.96154 0.370192 0.344158 
3 6.70566 0.558805 0.340326 
4 13.5159 0.844743 0.334405 
5 24.9907 1.24954 0.325774 
6 42.434 1.76808 0.31427 
4 66.142 2.36222 0.300486 
8 95.3736 2.98042 0.285515 
9 128 .952 3.58201 0.270398 
10 165.782 4.14455 0.255834 
50 2073 .46 10.3673 7.63746E-02 
90 4079 .34 11.3315 4.61885E-02 
200 9646.14 12.0577 0.022936 
FORWARD SNR = 100 FEEDBACK SNR = 1 
ASYMP. SNR = 100.99 ASYMP. E(O) = 25.2475 
N EQ. SNR = Sy EQ. EF) = Ey) | CAPACITY = cy 
1 100 25 2.30756 
2 200.98 25.1225 1.82704 
3 301 .965 25.1638 0.95227 
4 402 .952 25.1845 0.750162 
5 503 .94 25.197 0.622444 
6 604.928 25.2053 0.533897 
7 705.916 25.2113 0.468637 
8 806.905 25.2158 0.418403 
9 907 .894 25.2193 0.378457 
10 1008.88 25.2221 0.345879 


Since the capacity, 3 log (1 + Sy), of the equivalent Gaussian channel 
(with signal-to-noise ratio Sy), cannot exceed N times the capacity, 
4+ log (1 + p), of a single channel (with signal-to-noise ratio p), Sy as 
given by equation (20) is in fact optimal. Thus 


corr(p, ©, N) = [(1 + p)* — 1)", (21) 


which is an exponential in NV. 


IV. APPLICATION TO DIGITAL COMMUNICATION 
4.1 Schalkwijk-Katlath Technique 


Suppose we wish to transmit one of M equally likely messages over 
a Gaussian forward channel with signal-to-noise ratio p with the aid of 
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TasLE I—(Continued) 


FORWARD SNR = 100 FEEDBACK SNR = 100 
ASYMP. SNR = 199.01 ASYMP. E(0) = 49.7525 
N EQ. SNR = Sy EQ. E(0) = Ey(0) | CAPACITY =cy 
1 100 25 2.30756 
2 297 .078 37.1347 1.42434 
3 495 .429 41.2858 1.03457 
4 694.043 43.3777 0.817997 
5 892.77 44.6385 0.679545 
6 1091.56 45.4816 0.583023 
7 1290.39 46.0853 0.511677 
8 1489 .25 46.539 0.456669 
9 1688 .12 46.8923 0.412887 
10 1887 .02 47.1754 0.377164 
FORWARD SNR = 100 FEEDBACK SNR = 1000 
ASYMP. SNR = 1090.1 ASYMP. E(O) = 272.525 
N EQ. SNR = Sy EQ. E() = Fy(0) | CAPACITY = cy 
I 100 25 2.30756 
2 1092.78 136.597 1.74935 
3 2173.1 181.091 1.28073 
4, 3258 .25 203 .641 1.01116 
5 4345 .00 217.2538 0.837702 
10 9786.78 244.669 0.459444 
15 15232 .7 253.878 0.321.042 
20 20680. 1 258 . 501 0.248424 
40 42474.8 265.467 0.183209 
60 64272 .6 267 .803 9 .22575E-02 
80 86071 .7 268.974 7.10184E-02 
100 107871. 269 .679 5.79435 E-02 
120 129672. 270.149 4..90532E-02 
140 151472. 270.486 4 .26006E-02 
160 173278. 270.738 3.76957 E-02 
180 195073. 270.935 3.38365E-02 
200 216874. 271.093 3.07177E-02 





a Gaussian feedback channel with signal-to-noise ratio 6, using the 
forward channel N times and the feedback channel NV — 1 times. Fol- 
lowing Schalkwijk and Kailath,’® we assign to message i (¢ = 
1,2, --- , M) the number 6 = 6; = 7 — (MV + 1)/2. Thus the M mes- 
sages are equally spaced on the interval [—(M — 1)/2, (M — 1)/2] at 
distance 1 apart. We can now apply the results of Sections III and IV 
to transmit 6. The expectation HL’ = a; is 


o = (M+1)WM—1)/12, M=1,2,3---. (22) 
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When message 7 is transmitted, the output of the system is 6 = 6; + &, 
where £ is a zero mean Gaussian random variable with variance y’. We 
select as the decoder output, that j(1 <= j < M) which minimizes 
| 6 — 0; |, so that we make an error only when | £| 2 4. This event has 
probability 


P, = 28(—1/2y), (23) 


where B(x) = 1/(27m)' f7,, exp (—2’/2) dx is the cumulative error 
function. Thus the smallest error probability attainable using this 
scheme (with parameters N, p, f) is 


(24) 


] 
P., or = 28| -s— | 
oie Zeopr(p, p, N)oe 


where gy is given by equation (22) and eopr in Section IJ. The bounds 
ON €$pz in Section III immediately yield bounds on P, opr. 

Let us assume that every 7’ seconds, a digital message source emits 
one of M = e”” equally likely messages (R is the message “‘rate’’). 
Further assume that N = aT (for example, if the “physical” channel 
has bandwidth W eps, then a = 2W). Consider two cases: fp = ©, fp < ©. 

(i) When #6 = ©, it follows immediately from equation (21) and 
(22) that as T > « 


1 (1 ae p)*" 2 (C-R)T QF 
2€opr(p, oe, N)as v3 a 7 v3e (25) 


where C = (a/2) log (1 + p) is the channel capacity in nats per second. 
Thus, provided R < C, as T — o the argument of © in equation (24) 
becomes infinite and P,.5p, — 0. In fact, (since ®(2) ~ (2rx”)™?* exp 
(—2°/2), ast — «) 


CcC-— 
Pe. opr = exp le” eee rave |e 


as JT] @, (26) 
a double exponential decay. This is the celebrated result of Schalkwijk 
and Kailath.’?’” 

(iz) If we try to apply the same scheme when the feedback signal- 
to-noise 6 < ©, then from equation (18) (2eopra4)' > O as T— o. 
Thus it is not possible using this scheme to obtain vanishingly small 
error probability as 7’ — o with fixed signal-to-noise ratios in the 
forward and feedback channel. This is so no matter how large 6 may 
be, provided it is finite. For finite 7’ however, equations (18) and (24) 
yield useful estimates of attainable error probabilities. 
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4.2 Improving the One-Way Error Exponent 


Suppose that, as in Section 4.1, we wish to transmit one of M = e”” 


equally likely messages in 7’ seconds. Suppose that we use only a for- 
ward Gaussian channel (with signal-to-noise ratio p) n, = aT times. 
Then it is well known that one can attain an error probability 


P, = exp l=a.@ : pla + a(n) | , a Too, (27) 


where ,(R/a, p) > 0, if R < a/2 log (1 + p) = C, the channel capacity. 
As indicated, the quantity L,(R/a, p) depends on R and a only through 
their ratio. Although Z, is not known exactly, estimates are given in 
Ref. 3. In particular, £,(0, p) = p/4 and E,(C/a, p) = 0. 

Now suppose we have a Gaussian feedback channel available with 
signal-to-noise ratio #. Let us divide the n, forward channel uses into 
vy = n,/N groups of N forward channel uses. In each of these groups we 
use the extended Elias scheme, (of Sections III and IV, with N uses of 
the forward channel and N — 1 uses of the feedback channel) to gen- 
erate an equivalent forward Gaussian channel with signal-to-noise ratio 
Sy given by the recurrence (9). We then use a one-way coding scheme 
with v = n,/N = (a/N)T uses of the equivalent forward channel. 
With N held fixed as T — o, we can attain an error probability as in 
equation (27) with a replaced by (a/N) and p replaced by Sy—namely, 


RN L 
P, = exp | -2, (28 Sx) ra + a(t) |: (28) 
Thus the new error-exponent is 
i RN 
Ey, Pp, p) _ + Bs ’ Sw): (29) 
a 


Since N is arbitrary, we can state our result: 


Theorem: Given a forward and feedback Gaussian channels which can 
each process a inputs (independently) per second, with signal-to-noise 
ratio p and p respectively. Then it is possible to transmit digital data at a 
rate R nats per second with error probability 


P, = exp [—L*aT + o(T)], as T— o, (30a) 


+The conventional power constraint for a one-way channel is that the time 
average of the square of the inputs must not exceed P. The power constraint used 
here is that the statistical expectation of the square of each input not ex- 
ceed P. Nither of these constraints imply the other. However, it is not hard to 
show that the estimates of Hi (in Ref. 3) are valid for both constraints. 
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where 
* = wo(E Dp, 6) = sup Ly = sup 2 n,(BR Sx) : (30D) 
1sN<o 1sN <0 
Sy is the solution to the recurrence (9), and E, ts the reliability (error- 
exponent) for the one-way Gaussian channel as in equation (27), and T 
as the encoding-decoding delay. 
Remarks: 


(t) Since S,; = pand H,(R/a, p) > OforR < a/2 log (1 + p) = C, 
then E£*(R/a, p, ~) > Ofor R < C. 
(27) Since E,(0, p) = p/4, 


Ey(0, P, p) = + 2,(2X ’ Sy) 


so that from equation (16), 


Sy 


reo 4N’ 








E*(0, p, p) = Be 8 (1 aor +. -) , as No, 
In fact, since Sy/N can be shown to be non-decreasing, H*(0, p, 6) is 
in fact equal to this quantity. Thus the use of the feedback channel 
represents an improvement of a factor of [1 + #/(1 + p)] in the error- 
exponent at zero rate. 

(cit) We can get a rough idea of the behavior of H*(R/a, p, p) as 
follows. Let r = R/a be the rate in nats per channel use. Let us crudely 
approximate the one-way exponent £,(r, p) as r varies from 0 to c = 
C/a (the capacity in nats per channel use) by a straight line connecting 
(r = 0, LE, = p/4) and (r = c, £, = 0). See Fig. 1. 

Then /, has r = 0 intercept at 


Se pp. +P 

2-4 8[(1 +p) + p]~ 4’ 

and E2(r, p, p) = Oatr = c = (1/2)(1/2) log (1 + S,). Similarly, Ly 
has r = 0 intercept at 


£00, p, p) = =f + 


a os ON. 2 ON. 
Ey, P, p) = 4N 7 4(N aa 1) ? 
and Ey(r, p, 6) = Oatr = cy = (1/2N) log (1 + Sy). From Fig. 1, 
we see that for each value of r > 0, there is a value of N(1 S N < ~) 


which maximizes E’'y(r, p, 6) to achieve E*(r, p, 6). Values of Ey(0, p, A) 
and cy are tabulated for various values of p, 6, and N in Table I. 
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Fig. 1— E*(r, p, 6) vs. r (an approximation). 


(zv) We see from Fig. 1, that the feedback scheme offers no improve- 
ment over the one-way scheme (that is, Hy < EH, , for N > 2) forr* S 
r < cwhere 7* is the solution of H, = E, , that is, 


3H, (2r*, S2) = E,(r*, p). 


However, the rate r* > casp—- o. 

Actually, it is probably possible to improve on our results substantially 
and in particular bring about an increase in the error-exponent for all 
r<c. Let {N,,N2,--- ,N,} be a set of positive integers (not neces- 
sarily equal). Then divide the n, = aT’ forward channel uses into vy = 
no/(N, + N. + --- N;,) uses of an equivalent channel which is the 
parallel combination of & Gaussian channels with signal-to-noise ratios 
Sy, , Sy, , °°: , Sy, . These & Gaussian channels are generated by 
Ni, Nz, -:- , N, iterations, respectively, of the Elias scheme. One 
must then compute the error-exponent for a parallel combination of 
channels to obtain a new improved exponent.’* We leave this task as an 
open problem. 

(v) Let us finally remark that although the expectation of the channel 
input power x” is constrained, the quantity x” is in fact a random vari- 
able distributed on the interval [0, ©). This is in contrast to the one-way 
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schemes where the channel input is bounded. This point is discussed 
in Ref. 13. 
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Mode Conversion Caused by Surface 
Imperfections of a Dielectric 


Slab Waveguide 


By DIETRICH MARCUSE 
(Manuscript received May 8, 1969) 


This paper contains a perturbation theory which is applicable to the 
scattering losses suffered by guided modes of a dielectric slab waveguide as a 
consequence of imperfections of the waveguide wall. The development of 
the theory occupies the bulk of the paper. Numerical results appear in 
Sections VI and VIII to which a reader less interested in the theory ts 
referred. 

The theory allows us to conclude that random deviations of the waveguide 
wall in the order of 1 percent, for guides designed to guide an optical wave 
of Ao = Ly wavelength, can cause scattering losses of 10 percent per centi- 
meter or 0.46 dB per centimeter. A systematic sinusoidal deviation of the 
waveguide wall can cause total exchange of energy from the lowest order to 
the first order guided mode in a distance of approximately 1 cm tf the ampli- 
tude of the sinusoidal deviation from perfect straightness is only 0.5 percent 
of the. thickness of the guide. An rms deviation of one of the waveguide walls 
of 9A causes a radiation loss of 10 dB per kilometer (index difference 
1 percent, guide width 2.5). 


I. INTRODUCTION 


The problem of how to transmit laser light over large distances or 
carry it short distances inside the laboratory has renewed the interest 
in dielectric waveguides.*~° Such waveguides usually used in the form 
of clad fibers or as strips of a medium of larger dielectric constant 
embedded in another diclectric medium are capable, in principle, of 
guiding electromagnetic radiation. By proper dimensioning, a dielectric 
waveguide can be made to transmit only one guided mode. In this 
respect mode guidance by dielectric waveguides resembles mode guid- 
ance by hollow metallic waveguides. Hollow metallic tubes can be 
constructed to allow only one mode to propagate so that mode conver- 
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sion (except for conversion to the reflected dominant mode) becomes 
impossible. Such truly single mode operation is impossible for dielectric 
waveguides since these guides can always lose electromagnetic energy 
to the continuous spectrum of unguided modes. 

The possible solutions of Maxwell’s equations for a dielectric wave- 
guide consist of a discrete spectrum of a finite number of guided modes 
plus a continuum of waveguide modes.° The guided modes have field 
configurations which concentrate the electromagnetic energy inside 
and in the immediate vicinity of the structure. The continuum of un- 
guided modes extends to infinite distances from the waveguide and 
consists of a superposition of incident and reflected waves. A convenient 
way of visualizing the physical significance of the continuum of unguided 
modes is as follows. If a plane wave is incident on the dielectric wave- 
guide at an arbitrary angle, part of it penetrates the dielectric structure 
while some portion is reflected. The resulting superposition field of 
incident and reflected waves satisfies Maxwell’s equations and the 
boundary conditions at the dielectric waveguide and as such can be 
viewed as a mode of the structure, but the energy of this mode is not 
concentrated near the waveguide and there are no specific restrictions 
on the projection of the propagation vector in the direction of the guide 
axis. 

A perfect dielectric waveguide can transmit any of its guided modes 
without converting energy to any of the other possible guided modes 
or to the continuous spectrum. But any imperfection of the guide, such 
as a local change of its index of refraction or a deviation from perfect 
straightness or an imperfection of the interface between two regions 
with different index of refraction, couples the particular guided mode to 
all other guided modes as well as to all the modes of the unguided con- 
tinuum. Imperfections of this type are unavoidable. They transfer 
energy from the desired guided mode to unwanted guided modes and 
the radiation field of the continuum of unguided modes, thus increasing 
the loss of the desired guided mode. 

This paper gives a simple, approximate theory of the losses of di- 
electric waveguides, caused by imperfections of the boundary between 
the inner region of higher dielectric index and the surrounding outer 
region of the dielectric waveguide. Even though the method of analysis 
used here can be used to describe any arbitrary dielectric waveguide, 
we limit the discussion to a simple case. We describe the effects of mode 
conversion for a dielectric slab surrounded by vacuum, assuming for 
simplicity, that there is no variation of the dimensions or properties of 
the rod as well as the field distribution in one co-ordinate direction. The 
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restriction of demanding 0/0, = O for one of the co-ordinates y is no 
limitation on the method of analysis but is imposed strictly for con- 
venience. It simplifies the analysis considerably without drastically 
changing the conclusions. The tolerance requirements based on our 
analysis are rather stringent. They show the order of magnitude of the 
losses which can be expected from deviations from perfect geometry. 
Additional variations in the direction considered perfect in this paper 
is unlikely to improve any of the loss predictions. 


II. TE MODES OF A DIELECTRIC SLAB 


Let us consider the transverse electric modes of the dielectric slab 
of Fig. 1. True to the simplifying assumption discussed in Section I, 
we assume 


= 0 (1) 


|e 


with y being the co-ordinate perpendicular to the x and 2z directions, but 
parallel to the slab. The only nonvanishing field components are L, , 
H,,and H,. : 

Leaving the z and time dependence 


pital pe) (2) 


understood, we obtain the following modes of the ideal structure as a 
solution of Maxwell’s equations satisfying the boundary conditions. 





Fig. 1— Geometry of a dielectric slab waveguide. 
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2.1 Even Guided Modes 


For even guided modes 





& = A cosxe for |x| Sd, (3a) 
& = A coskde™*” for x = d, (3b) 
2 0&, 
, a wm Oz 5) (4) 
2 06, 
z ayn Ox ? (5) 


The field component &, satisfies the wave equation 


as, , O°&, 
Or + be. + nok? Ge = 0. (6) 


The value of the index of refraction 7, is different inside and outside of 
the dielectric slab. For simplicity, we assume 


M =1 for |x|>d. (7) 
The other constants are related as follows 
k? = w°eotto , (8) 
x = (vk? — p°)}, (9) 
y= (° — By. (10) 


The propagation constant 8 is obtained as a solution of the eigenvalue 
equation 


tanxd = *. (11) 


K 
The mode amplitude A can be expressed in terms of the power P carried 
by the mode. 
P=3Ref (-eset)dr=£ [| 6, Par. (12) 
—0 Wp Jo 


P is the power per unit length (unit length in y-direction) flowing along 
the z-axis. We obtain for the amplitude coefficient 


esc OS py (13) 


pd +5 
Y 
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2.2 Even Modes of the Continuum 


The continuum of unguided modes of even symmetry is given by the 
equations: 


& = B® cosox for |x| Sd, (14a) 
& = OMe + Dei for 2 = d. (14b) 


The other field components follow again from equations (4) and (5) 
and &, is a solution of equation (6). The constants are related to each 
other by the equations 


o = (nk? — py, (15) 

p = (k’ — 6°). (16) 
The radial propagation constant p can assume all values from 0 to «. 
The continuous mode spectrum starts at 8 = k and continuous to 6 = 0 
at which point we have p = k. Larger values of p are obtained for 
imaginary values of 8 corresponding to modes of the continuum exhibit- 
ing a cutoff behavior. 


The boundary conditions do not lead to an eigenvalue equation for 
8 but they determine C‘” and D™ in relation to B’. 


co = BB e*( cos od + isin rd) ; (17) 


DSO (18) 


(the asterisk indicates the complex conjugate quantity). 
The normalization of the modes of the continuum involves a 6-func- 
tion. Instead of equation (12) we use 


P io — of) = | a(o)8h(o") de. (19) 
Wu Jo 
With this normalization we get 
Bo? de ae (20) 
7( cos od + x sin? od) 


2.3 Odd Guided Modes 


In a manner similar to that for obtaining the preceding equations we 
obtain the equations for the odd guided modes 


& =A sinwe for x Sd, (21a) 
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& = A sinkde?™?- for «2 d. (21b) 


Equations (4) through (10) apply to the odd modes unaltered. The 
eigenvalue equation is given by 


tan kd = ae ; (22) 
and the mode normalization is 
AO? = pes (23) 
Bd + : 


2.4 Odd Modes of the Continuum 


As in Section 2.8 we obtain the equations for the odd modes of the 
continuum 


& = B® sinex for |a«|<d, (24a) 
6 = Ce a DM er for: a 20d, (24b) 
Cs Be (sin od — if cos ad} , (25) 
D® = om, (26) 
por = enh (27) 


2 
x8 sin’ od + cos” od) 


All these modes are orthogonal to one another. The even modes are 
orthogonal to all the odd modes, the guided modes are orthogonal to 
all the modes of the continuum, and all guided modes as well as all 
modes of the continuum are orthogonal among each other. The or- 
thogonality of the modes of the continuum among each other was 
already expressed by equation (19). Labeling the discrete modes by 
indices and dropping the vector component label y we can express the 
orthogonality of the discrete modes by the equation 


— Bm f * 
P ban = ae ae S565 dt: (28) 


III. MODE COUPLING CAUSED BY IMPERFECTIONS 


We want to study the losses which the lowest order guided mode 
suffers because of imperfections of the waveguide wall. A dielectric 
waveguide with wall imperfections is shown in Fig. 2. 
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Fig. 2 — Dielectric slab waveguide with wall distortions. 


The waveguide with wall imperfections is mathematically described 
by a refractive index distribution 


n'(a, 2) = ne(a, z) + An’(z, 2). (29) 
The index distribution 
2 
n(x, 2) = . esd (30) 
1 |e|>d 


describes the ideal dielectric waveguide whose TE modes were given in 
the Section II. The additional term An’ describes how the guide deviates 
from its perfect shape. Consider a deviation shown in Fig. 3. The 
corresponding distribution An” is (n, = index of refraction of the di- 
electric material of the guide) 


of <d if d< f(z) 
x<f@ if d> fe 
nz—1 ad<a<fe) if d< f@) 
—~@—-1) f@<x<d if d>4@ 
, f > fe if d< fi) 
z>d if d> fe) 


An’? = (31) 
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Fig. 3 — Illustration of the wall distortion function f(z). 


The field distribution EF, of this waveguide is a solution of 








2 2 
Se + St OB + bn\H, = 0 o) 


with H, and H, given by equations (4) and (5). The modes of the perfect 
waveguide form a complete orthogonal set for all TE modes with no 
variation in the y-direction. It is, therefore, possible to express any 
field distribution on the waveguide with imperfect walls by the expansion 


B, = DO.) + X | olo,2)8) dp. (33) 


The first summation extends over all even and odd modes of the discrete 
spectrum of guided modes. The integral extends over all modes of the 
continuum, and the summation sign in front of the integral indicates 
summation over even and odd modes. The expansion coefficients C,, 
and g(p) are unknown functions of z. 

To obtain a coupled system of differential equations for the expansion 
coefficients we substitute equation (33) into equation (382). Multiplying 
the resulting equation by 

Bm ox 
2wp Em 
integrating over x from — o to +, and using the orthogonality rela- 
tions and the fact that &, and &(p) are the (discrete and continuous) 
modes of the perfect guide leads to 
Cn ie Cg: Ce 
re ee 278 mn aa F,,(2) (34) 
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with 
7 Buk? ie) 5 
F(z) = ~ DeopP & C,,(2) iZ S* An’ &, dx 
a | dog(o,2) [ &* An* &(p) ax |: (35) 
0 —o 
Similarly multiplying by 
B" ; 
Fis &*(p’) 
leads to 
d°9(p") _ 97,7 0g(p’) _ . 
Oe 276 ay G(p 2) (36) 
with 





he? ee) . 
G(p’, 2) = Se |= c,(e) | 8*(p’) An’ &, dx 


cae i : dpg(p, 2) [ ; 8*(p’) An’ &(p) ax (37) 


No n-label on the power term P is necessary since we assume that all 
the normal modes are normalized to the same amount of power. The 
actual power carried by each mode relative to the power of the other 
modes is given by the C,, coefficients. Solutions of equations (84) and (85) 
with appropriate initial conditions provide us with exact solutions of 
the imperfect waveguide. It is interesting to note that this method of 
solution does not require the consideration of boundary conditions. 

The normal modes &, and &(p) were assumed to have the time and 
z-dependence of equation (2); this means they represent waves traveling 
in the positive z-direction. However, the solutions of equations (84) and 
(36) introduce waves traveling in positive as well as negative z-direction. 
To see this, let us assume that An” = 0 so that F,,(z) = 0. The equation 





OC. cestee c. 

Oe" = 228, 02 => 0 (38) 
has the solution 

C,(2) = A -+ Be7*?n? (39) 


with constant A and B. The product of A with &, results in a wave 
traveling in the positive z-direction but the product of B exp (27,2) 
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with &, results in a wave traveling in the negative z-direction. So, even 
though we started out with waves traveling in the positive z-direction 
the expansion (33) contains partial waves traveling in positive as well 
as negative z-direction. 

For the purpose of obtaining perturbation solutions of equations 
(34) and (86), an integral form of these equations is more useful. Treat- 
ing equations (34) and (86) as inhomogeneous differential equations, 
we can immediately write the following integral equations 





_ 2iBmz 1 ° 2iBm(z-$) fi 
Cm = An + Bye? + 5 } [e LF,(¢) dt, (40) 


218! 2 1 : 2718! (z2- r 
a(o!,2) = Clo!) + Die? + x, fe — NGG, © ab. 
: 0 





(41) 


It is important to know which part of equations (40) and (41) is as- 
sociated with waves traveling in the positive or negative z-direction. 
Therefore, we introduce the notation. 


CoG (42) 
with 
+ 1 ‘ 
COR) = An = a5 [ P(g) dt, (43) 
C2@ = {B.. ih i i omg) aha (44) 


The superscript (+) indicates the coefficient which after substitution 
into equation (33) produces waves traveling in positive z-direction, 
while (—) indicates the part which produces waves traveling in negative 
z-direction. A similar notation and resulting equations is used for g(p’, 2); 
however, the corresponding equations are obvious and are therefore 
omitted. 

The constants A,,, B, , and so on, occurrng in equations (43), (44), 
and the corresponding equations for g(p’, 2) must be determined from 
initial conditions. We always assume that the lowest order guided mode 
is incident on the imperfect waveguide at z = 0. Using the subscript 0 
for this incident mode we get immediately from equation (43) 


Cc =0 for m0 at z=0 
or 
A, =0 for m#0O, (45) 
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but 
A o> 1 . (46) 


We imagine that at z = L the waveguide is connected to a perfect guide 
so that at that point there are no waves traveling in negative z-direction. 
This leads to the condition 


1 


eS = -238 mo 7 
Dip, Jp oe mls) ag (47) 


Ba = 
for all values of m. The power loss AP of the incident mode due to mode 
conversion is given by 

AP = < . 
p= LUPO P+ OPO [1 


+> ie [loo ZL) |? + | ge, 0) |7] do. (48) 


Equation (48) states that the total power lost by mode conversion from 
the incident mode escapes at z = ZL in spurious modes traveling in posi- 
tion z-direction and at z = 0 in spurious modes traveling in negative 
z-direction. The factor P is the normalized power factor of equations 
(12) and (19); it is the power incident in mode 0. Notice that because 
of equations (45) and (47) only the integral terms of equations (43) and 
(44) (taken from z = 0 to z = L) enter into equation (48). 

The integral equations (43) and (44) can only be solved approxi- 
mately. We perform first order perturbation theory by using C,,(0) 
instead of C,,(z) and g(p, 0) instead of g(p, z) in equations (35) and (87). 
Furthermore, we realize that C‘?(0) for all m is a quantity of first 
order and will therefore be neglected in equations (85) and (87). The 
same is true for C{* (0) with m + 0. In the spirit of first order perturba- 
tion theory we use therefore 


Cn = Som (49) 
and 


g(p) = 0 (50) 


in equations (35) and (87). 

The perturbation theory is feasible not only when n> — 1 <1 but 
also when n° — 1 is arbitrarily large but the geometrical deviation of 
the guide walls from perfect straightness is slight. In either case we 
obtain from equations (35) and (87) the simple approximations 
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BA 2 * 
F,,(2) = ~S5.. Dp (n; —~ 1) {f@) ima d] Eo(d, z) &k(d, 2) 
2QwpP 
> [h(z) = d| &o(—d, z) Ex(—d, z)} ’ (51) 


EG@ai= - 5 n? — 1) {[J@) — dl€*(o, d, 2) Bold, 2) 


= [h(z) + d| &*(p, —d, 2) &(—d, z)} . (52) 


The function f(z) describes the dielectric-air interface in the vicinity 
of x = d, while A(z) describes it near z = —d. We assumed that f(z) 
and h(z) depart so little from x = d and x = —d that the functions 
&(x, 2) could be replaced by &(=d, 2). 


IV. EVALUATION OF THE SPURIOUS MODE AMPLITUDES 


We begin the discussion of the consequences of our scattering theory 
by calculating the coefficients C“ and g‘”’. We obtain [from equations 
(48) and (51) with the help of equations (3a) and (13) for the even 
modes] the following 


C820) = Ge Gol — 1) pO a bem — Hn) (68) 
Bo\ (5 4 Bn) 
| (60 v B)(p v Ym 
The coefficients ¢,, and y,, are defined by 
L 
om =F f UO — deter de (54) 
and 
Yn = = fhe) + dle de, (55) 
0 


These are the Fourier coefficients of the functions f(z) — d and h(z) + d 
which are expanded in a domain 


OszEL. 


The amplitude of the mth even mode depends on the Fourier components 
of the wall function whose “spatial frequency” T is 


bo 


am, 
An 


The corresponding expression for the even modes of the continuous 


Pi = = Bo a Bn . (56) 
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spectrum is: 


Lk’ COS Kod Cos cd[y(B) — v6) 








aC ee 1 - (57 
9:°(0, L) = azar — D ; z. ; 67) 
Bod sa '/8 cos’ od + sin od 
with [8 = B(p) see equation (16)] 
L 
08) = = [ I@ — dete" ae, (58) 
er 

WO) =F [ @ + aererP* ae, (59) 

The et expressions for the odd modes are 
COS Kod Sin k,d (60) 


Cro (L) = EF tn pay [on + )(o.0-+ 2) (gn + Yn), 


ft Fy ta Gh COS Kod sin od[y(6) o ¥(6)] . 
| (642 + 2s) sin od ee 3 Cos” ed) | 


(61) 


The Fourier coefficients g and y are given by equations (54), (55), 
(58), and (59) except that 8, and 8 are now the propagation constants 
of the odd modes. 

The corresponding expressions for C‘ and g‘” are obtained by 
replacing 8,, with —8,, and 8 with —8 in equations (54), (55), (56), (58), 
and (59). 


(-) 


V. SINUSOIDAL WALL DEFLECTIONS 


As a specific example, let us assume that the wall imperfections have 
sinusoidal shape. Then 


f(z) — d = asin 62 (62) 
and 
h(z) +d = —asin (62 + a). (63) 


The phase factor @ allows us to consider either a waveguide whose width 
varies sinusoidally 


a = 0, (64) 


3200 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


or one whose direction changes sinusoidally 


. a= Tt. (65) 
We obtain from equation (54) with 
6 = Bo — Bn (66) 
the Fourier component 
a 
Gn = 4 (67) 
and from equation (55) 
a ia 
Vn face, OF é . (68) 


A term of the order a/L < 1 was omitted in equations (67) and (68). 
It is apparent that only one spurious mode is excited by the sinusoidal 
wall deflection since condition (66) can be satisfied for only one value 
of.6,,. If condition (66) is not satisfied, ¢,, and y,, are of the order of 
a/L <« 1. The fractional power scattered into one spurious guided 
mode due to a sinusoidal wall irregularity is [from equations (48), (53), 
(67) and (68)] 


(48) Lak’ , 5 COS” Kod COS" km 


(sd ze es) (sd z Ba) 


m 
0 Yn 





452 2 a 
P 4 (n, — 1) COS" 5 (69) 


for even modes or [from equations (48) and (60)] 


(42) Dak 5s 
Og 





2 o. 2 
COS Kod sin’ x,,d sin? 2 (70) 


2 
(sc i. be) (1nd a ba) 
Yo Vm 


es 2 
= cm — 3) 


for odd modes. However only one even or one odd mode can be excited 
by one particular sinusoidal wall deviation since it is impossible to 
satisfy the ‘‘resonance’”’ condition (66) for more than one mode simul- 
taneously. 

If a = 0, that is if the width of the guide changes sinusoidally, only 
even modes can be excited while sinusoidal deviations from straightness 
(a = 7) couple the even fundamental mode only to odd spurious modes. 
It must also be noticed that for a long period length 


A= (71) 


ol? 
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equation (66) can be satisfied only for forward scattering modes. To 
couple to backward scattering modes, the period length D must be 
approximately equal to half the wavelength of the guided modes. 
The fact that only one spurious mode is coupled to the incident mode by 
sinusoidal wall imperfections (it can be shown that the coupling to the 
continuous mode spectrum is also weak if one guided mode can couple 
strongly) allows us to give a much better description of the coupling 
process. 

Since the mode amplitudes C,, can change only slowly in the distance 
of one wavelength we can neglect the second derivative of C,, in equa- 
tion (34). Labeling the incident mode 0 and the one coupled spurious 
mode 1 we can write the equation system (84) in the following form 





oC, 
ea => —KoC;, ) (72) 
oC 
at = KaiCo ; (73) 
with 
2 
ee ns (2 — 1) COS Kod COS Kd 2) ie 2 (74) 


i. a 31h exp (i 5 
0 1 
| (2.0 FE Be) aud =e as) 
The coupling coefficient xo, of equations (74) holds for coupling from 
an even mode 0 to an even mode 1. The case of coupling from an even 
mode 0 to an odd mode 1 can be treated similarly. In fact, except for 
an unimportant phase factor, we get it from (1/L)[(AP/P),,]* of equa- 
rion (70). In equation (72) we omitted a term with C4 on the right-hand 
side, and similarly a term with C, was omitted in equation (73). These 
terms would be multiplied by sinusoidally varying functions and would 
describe the local change of phase velocity as the guide dimensions vary. 
These terms give no contribution if we use an average over Cy and C, 
over the mechanical period length of equation (71). 

Assuming Cy) = 1, C, = 0 at z = 0 the equation system (72) and (738) 
has the solution 


Co = cos | ko; | 2, (75) 
Ren 3 . 

GC, = eres sin | Kon | 2. (76) 
Ko 


Total exchange of energy is possible between the two coupled modes. 
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The distance D over which all the energy is exchanged is given by 


D (77) 


aha ss 
2 | Koy | : 
Finally, we need the power loss to the modes of the continuous spectrum. 


From equations (48), (57), (61), (62), and (63) we obtain 


(4°) = ak* (n2 — 1) cO s° Kod 
; aeened 


P 
Bod + 22 
Yo 
ee cos’ od cos” ; sin’ od sin’ 5 








2 a 2 
a( cos od + x sin” od) a( sin od + i cos” vd) 


sin’ [9 — (8 — 61 ¥ 
To-@ ar 


The integration can be performed easily if one realizes that for large 
values of ZL only a very narrow region in the 8 range near B — By — 6 
contributes to the integral. We consider all functions in the integrand 
as constant in this very narrow range and take them out of the integral 
with the exception of 


(78) 


L|? 
leit [8 — (Bo — 8)] | 
6— (Bo — 8) J 
This remaining integral can easily be performed if we use equation (16) 
to obtain 


dp = —? ag. 
p 
Following this procedure yields 
(i) = Hoe = ya 
: Bod + = 
Yo 


= 
oo 2 + ee 
p cos od + o’ sin’ od ° p sin’ od + o° cos’ od 


* * a 
p sin’ odsin’ = 





p cos’ ad cos” 
: (79) 
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The parameters o and p follow from equations (15) and (16) with 
B = Bo — 6. (80) 


Equation (79) holds only for 8 < k; we get AP/P = O for 8 > k. The 
most interesting aspect of equation (79) is its linear dependence on L. 
The scattering loss due to the modes of the continuous spectrum acts 
like a true loss process. By contrast, the corresponding equation (69) 
for the loss to guided modes is proportional to L” because coupling to a 
guided mode does not result in loss of energy but results in energy ex- 
change between the two coupled modes. Energy loss to one of the guided 
modes is followed by energy gain when the energy exchange has reversed 
itself, 


VI. NUMERICAL EXAMPLES FOR SINUSOIDAL IMPERFECTIONS 


A few numerical examples resulting from equations (74) and (77) 
are listed in Table I. Two different values of the index of refraction n, 
have been assumed, and for each value of the index three different 
values of kd = 27(d/o) have been chosen so that one, two, or three 
guided modes can exist simultaneously. The mode with £, is the lowest 


TABLE I—NuMERICAL EXAMPLES FOR SINUSOIDAL IMPERFECTIONS 





aD 
Ng kd Bod Bid Bod a Remarks 
1.3 1.729 —_ —_— — Single mode 
operation 
1.8 2.495 1.916 — 6.98 0 — 1 coupling 
a = T 
1.5 SEN 
6.17 0 — 1 coupling 
a >= 
3.0 4.336 3.831 3.051 © |, ——_—_- 
5.52 0 — 2 coupling 
a=0 
8.0 8.041 — = — Single mode 
operation 
15.0 15.113 15.022 — 42.54 0 — 1 coupling 
a= 
1.01 |---| _ , ——_ |__| 


36.28 0 — 1 coupling 

a= TT 
23.0 23.199 23.112 23.002 ©. |_| ——_______ 
43 .69 0 — 2 coupling 

a =0 
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order even guided mode which is assumed to be incident on the wave- 
guide with sinusoidal wall imperfections. This mode couples to the first 
odd mode with 6; or the next even mode with 8, . The values for the 
normalized, dimensionless quantitity (aD)/d°’ [a = amplitude of the 
sinusoidal wall deviation according to equation (62) and (63), d = half 
width of the guide, and D = energy exchange length] have been ob- 
tained with the assumption that equation (66) is satisfied for the two 
modes which are coupled together. Coupling from mode 0 to mode 1 is 
considered only for the case of sinusoidal straightness deviations of the 
waveguide (a = 7) while coupling between even modes 0 to 2 is con- 
sidered only for sinusoidal changes of the thickness of the waveguide 
(a = 0). It is immediately apparent from Table I that the energy ex- 
change length D is shorter for a guide with larger values of the refractive 
index. 

To obtain a feeling for the numbers involved in this mode coupling 
phenomenon, let us assume that n, = 1.5 and that the free space wave- 
length is \) = lu. The value of kd = 1.8 corresponds to d = 0.286u. 
To achieve total exchange of energy between modes 0 and 1in D = 1em 
requires the extremely small amplitude a = 5.72 107° ora = 0.572 Alt 
The length of the mechanical period in this example is A = 3.1p. 

Next, let us assume that the index of refraction is n, = 1.01. Using 
again, Ay = lu, we obtain from kd = 15.0 the value d = 2.39y for the 
half width of the waveguide. Requiring again, D = 1 cm, we finda = 
243 A. 

We can look at this problem in a different way. It is unlikely that any 
optical waveguide has a strictly sinusoidal deviation from perfect 
straightness. In fact, the numbers just presented show that it would 
be impossible to produce such a waveguide intentionally. However, we 
have seen [equation (53)] that the mode conversion between two guided 
modes is produced by a Fourier component of the actual deviation func- 
tion. It is therefore not necessary to have a strictly sinusoidal straight- 
ness deviation. Any arbitrary deviation from straightness can be de- 
composed into a Fourier series and the Fourier component at the 
mechanical frequency which satisfies equation (66) is responsible for 
the coupling. In the more general case of arbitrary straightness devia- 
tions, there can be no complete exchange of energy between any two 
modes since power loss to other guided modes and the continuous 


+ A mechanical period of a fraction of an Angstrom is somewhat unphysical due 
to the granular nature of matter. However, this result can be restated to say that 


complete power conversion occurs in 0.1 mm if the amplitude is a = 57.2 A. 
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spectrum of modes compete with each other since all of them are coupled 
simultaneously. 

We can now ask the question: What amplitude of the mechanical 
straightness deviation is required to transfer 10 percent of power from 
mode 0 to mode 1 in a distance of LZ = 1 em? Again, we use the previous 
examples. From equation (76) [or directly from equations (53) and (77)] 
we obtain 


(te oe ee 
me 


For the first example we obtain with n, = 1.5, AP/P = 0.1, d = 0.286u, 
and aD/d’ = 6.98 the value a = 0.115 A.t This result shows that if 
the Fourier component of the mechanical straightness deviation with a 
period length of 3.1p is a = 0.12 A (measured over a distance of 1 cm) 
the power loss caused by mode conversion to the first odd mode is 
10 percent. 

Tor the second example, we use again n, = 1.01, AP/P = 0.1,d = 
2.39u, and aD/d? = 42.54 and obtain a = 48.8 A. The important 
Fourier component in this case has a period of A = 135yu. The power 
loss to the modes of the continuous spectrum caused by a sinusoidal 
change in thickness of the waveguide (which is very similar to its effect 
as a straightness deviation) can be calculated from equation (79) with 
a = 0. 

Let us consider only one case, n, = 1.01, kd = 15, A/d = 25. For 
these values we obtain from equation (79) 

a AP 

aL P 
Assuming again AP/P = 0.1 for a guide length LZ = 1 cm, we obtain 
with d = 2.39n 


7 
DP 


= 4.6 X 107’, 


a = 5.46 X 10772 = 546 A. 


This number can be compared to the value a = 48.8 A which gave 
10 percent loss by conversion to one guided mode. However, for a 
meaningful comparison, we must remember that all the Fourier com- 
ponents of a Fourier expansion of the guide imperfections scatter power 
into the modes of the continuous spectrum. The total loss would have 
to be obtained by integrating the scattering loss over the spectral dis- 


t Again it is more reasonable to restate this example to say that 10 percent loss 
occurs over a distance of L = 0.1 mm if a = 12 A, 


3206 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


tribution of the Fourier components of the mechanical Fourier spectrum. 
Instead of doing this integration we use a different approach in Section 
VIL. 


VII. STATISTICAL TREATMENT OF WALL IMPERFECTIONS* 


Equation (48) gives the relative loss of a guided mode caused by 
a definite (deterministic) distortion of the boundary of a dielectric 
waveguide. A quantity that may be even more interesting is the average 
of equation (48) taken over an ensemble of statistically identical sys- 
tems. 

For simplicity, let us assume that one wall of the waveguide is perfect 
while the other is randomly distorted. If both walls are randomly dis- 
torted, with no correlation between the distortions on opposite walls 
the loss value doubles compared to the case of only one wall being dis- 
torted. If the distortions on opposite sides of the waveguide are per- 
fectly correlated the amount of loss is at most increased four times. 
So to simplify the discussion we assume 


h(z) +d =0. (81) 
In order to be able to calculate (AP/P),, , we must evaluate 
1 ft, f® | | 
(| Pm I ae am =; | az [ dz’R( — ge fe baens ) (82) 
0 0 


We assumed that the correlation function 
Rt — 2’) = (fe) — dif’) — d))av (83) 


depends only on the difference between the coordinates z and z’ but not 
on their individual values. 
A change of integration variables allows us to write 


(Ge Pee = = i: (L — u)R(u) cos (Bo — Bn)u du. (84) 


To obtain equation (84) we made use of the fact that R(u) is an even 
function. 

The particular form of R(u) depends on the statistics of the wall 
imperfections. However, all correlation functions have two features in 
common. They all have their maximum value at u = 0 and decrease to 
zero as u— ©. If R(u) would not become 0 as u — © there would be a 


+ An excellent statistical treatment of random coupling effects in metallic wave- 
guides can be found in Ref. 7. 
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systematic distortion of the waveguide boundary instead of the assumed 
random behavior. To get an idea of what one might expect, we assume 
the following form for the correlation function 


R(u) = A’ exp (—ul). (85) 


A is the rms deviation of the wall from perfect straightness and B is 
the correlation length. Using equation (85) we obtain from equation (84) 


2_ i 
92 A? 1 if (Bo = Bm) oo RB? | 


(len Paw = + 
* (8. — Ba)? + ii? 1(( — 64)" + 4) 


where we neglected terms with exp (—L/B) assuming that L/B is 
sufficiently large. In fact if 








(86) 





L>B, (87) 
equation (86) can be simplified further: 
he. AP if 
(len Par = Dr (88) 


(Bo — Bx)? + 3 


Using equation (88) we obtain, from equation (53) for the ensemble 
average of the square magnitudes of the even guided modes, 


A*k*L 





(1 One Pv = AEE oat — 1" 
coe Kod COS” nm ; (89) 
(( = Ba) = 3:)(e.a + Bs)(a,a = Bs) 


The corresponding expression for the odd modes is very similar except 
that cos” x,,d is replaced by sin’ «,,d and B,, km, and y, are the param- 
eters of the odd modes. 

The total loss caused by coupling to all guided modes supported by 
the dielectric waveguide is the sum over all (| C,, |*),, for even as well 
as odd modes traveling in positive (6, = +|8, |) as well as negative 
(8m = —| Bm |) direction. It is noteworthy that equation (89) is pro- 
portional to L and not to L”. The conversion to spurious guided modes 
by random imperfections appears as a true loss to the incident mode. 

The losses due to the modes of the continuous spectrum are obtained 
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from equations (48), (57), (61), (81) and (88) (with 8, = 8): 


AP AVL (y _4y a p COS” Kod 
).- ; 


(6 — 8)° + Fa) (aa + ©) 








cos’ od sin’ od 

(- cos’ od + o’ sin’ d v p sin’ cd + o” cos” 4) | dB. (90) 
The relation between 8, o, and p is given by equations (15) and (16) 
while Bo, kK), and yo are related by equations (9) and (10) and their 
value is obtained by solving equation (11). The integral in equation (90) 
is extended over 8 from —k to k, the range of real values of the propaga- 
tion constant (in z-direction) of the modes of the continuous spectrum. 
Equation (90) thus includes the losses due to forward as well as back- 
ward scattered radiation. The radiation modes with imaginary values 
of 8 can carry power away from the waveguide only strictly perpendicu- 
lar to its axis. This power loss, if any, is not included in equation (90). 


VIII. NUMERICAL RESULTS FOR THE STATISTICAL CASE 


Figures 4 through 9 show numerical evaluations of equations (89) 
and (90). These figures can be grouped into two classes. Figures 4 
through 6 are drawn for a dielectric waveguide whose index of refraction 
isn, = 1.01. Figures 7 through 9 apply to a waveguide with n, = 1.5. 
Within each of these two classes, the kd value was chosen to allow for 
three different cases. Figures 4 and 7 apply to waveguides which can 
support only the lowest order guided mode. In this case there is power 
lost only to the modes of the continuous spectrum. Figures 5 and 8 
apply to waveguides supporting two guided modes and Figs. 6 and 9 
apply to waveguides supporting three guided modes. Each figure shows 
the normalized loss caused by scattering into modes of the continuous 
spectrum as solid lines and the loss to the possible guided modes as 
dotted lines. Also shown are the ratios of backward to forward scattered 
power as solid lines for the modes of the continuum and as dotted lined 
for the guided modes. The total power lost to the lowest order guided 
modes is the sum of the losses to the continuum and the spurious guided 
modes. 

Several remarkable features of these loss curves are worthy of a com- 
ment. The losses caused by the modes of the continuum as well as by 
the guided modes peak at certain values of the correlation length B. 
The location of these peaks are different, however, for the continuum 
and guided modes. 
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Fig. 4— Normalized radiation loss (d3/A2Z) (AP/P) and ratio of backward to 
forward scattered power AP~/AP* as functions of the normalized correlation 
length B/d for ny = 101 and kd = 8.0. Single guided mode operation (d = half 
width of waveguide, A = rms deviation of one waveguide wall, LZ = Length of 
waveguide section, ny = index of refraction of waveguide, k = free space 
propagation constant). 


The losses to the guided modes increase with increasing number of 
guided modes supported by the waveguide. However, the losses caused 
by the continuum of modes also increase as an increasing number of 
guided modes can be supported. This increase is less rapid, however, 
as one might expect because of the dependence of equation (90) on the 
fourth power of k. The fourth power dependence on frequency (or inverse 
wavelength) is typical for Rayleigh scattering by small particles, and 
it is not surprising that we encounter it here. 

Finally, it is apparent from the curves showing the ratio of back- 
scattered to forward scattered power that forward scattering is pre- 
dominant for large values of the correlation length. The ratio of 
AP ~/AP* levels off for large values of B. In some of the curves the 
leveling of the AP~/AP* curves occurs out of the diagram but it is a 
common feature of all the curves. For small values of the correlation 
length there is as much scattering in the forward as in the backward 
direction. 

For many practical applications, a waveguide supporting only one 
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Fig. 5— Normalized power loss and ratio of foreward to backward scattered 
power for radiation (solid curves) and spurious guided modes (dashed curves). 
Two guided modes (ny = 1.01, kd = 15). 
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Fig. 6— Similar to Fig. 5. Three guided modes (nz = 1.01, kd = 28). 
— — — — two guided mode loss; ————— continuum loss. 
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Fig. 7 — Similar to Fig. 4. One guided mode (n, = 1.5, kd = 13). 
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Fig. 8 — Similar to Fig. 5. Two guided modes (n, = 1.5, kd = 18). 
— — — — one guided mode loss; ————— continuum loss. 
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Fig. 9— Similar to Fig. 5. Three guided modes (ny = 1.5, kd = 3). 
— — — — two guided mode loss; continuum loss. 


guided mode may be of most interest. Let us assume Ay = 1p. Figure 4, 
holding for kd = 8.0 and n, = 1.01, applies to a waveguide whose half 
width is d = 1.27y. Taking the worst possible case of B/d = 9 or B = 
11.4y, we find from Fig. 4 

d®* AP _ is 
We 6 X 10°. 
If we want to know how much rms deviation A of one wall of the guide 
would be required to cause a 10 percent loss (AP/P = 0.1) in one centi- 
meter of waveguide (Z = 1 em) we find A = 5.85 X 10°’ = 585 A. 
The ratio of A over d gives an idea of the relative tolerance require- 
ments: 





of L 

77 
If the waveguide were to conform to the conditions of Fig. 6, we would 
have for \) = 1p a half width d = 3.66u. The losses caused by the two 
spurious modes are of the same order of magnitude as the radiation losses 
caused by the continuous spectrum. For B/d = 10 or B = 36.6p we get 
4 total loss of 


4.6 X 10° = 4.6%. 
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@ AP 

A’L P 

To cause AP/P = 0.1 for Z = 1 cm requires that 


=34x 107. 


A=12X10%y or | = 3.28%. 
The relative tolerance requirements are, therefore, approximately the 
same in both examples. 
As a last example let us use Fig. 9 corresponding (A) = 1p) to a wave- 
guide with n, = 1.5 and a half width d = 0.477yu. For B/d = 1.3 or 
B = 6.2u we find for the total loss 


a AP», = 
iL pe 2.3 X 10™. 
We get AP/P = 0.1 with L = 1 cm for 


A =2.18 X107y = 21.8A or = = 0.457%. 


| ps 


The perturbation theory, strictly speaking, holds only for small values 
of AP/P. However, it is reasonable to expect that the power scattered 
into the radiation modes escapes sufficiently rapidly so that no appreci- 
able amount of power reconversion from the radiation field to the guided 
mode occurs. The incremental power loss, AP/P = —alL, is therefore 
the same for any section of the guide so that we obtain the total scatter- 
ing loss into the continuum of radiation modes P = Pye~*”. We may 
now ask how much rms deviation is required to cause a radiation loss 
of 10 dB/km or a = 2.3 km™ = 2.3 X 107° em™. Using B/d = 10, 
corresponding to the top of the loss curve of Fig. 4, we obtain the 
equation 


3 
& x 23x 107° =6 x 10° 


so that (A = lp, n, = 1.01, kd = 8.0,d = 1.27 X 10°* cm) 


‘ = 6.98 X10 or A = 8.86 X 107° om = 8.86A. 
This figure dramatizes the stringent tolerance requirements of dielectric 
waveguides for long distance optical communications. In fact, such 
tolerances seem impossible to obtain. One can only hope that the cor- 
relation length can be kept far from the worst possible value of B/d = 10 
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(in this example) so that these extremely stringent tolerance require- 
ments might be eased. 


IX. CONCLUSION 


We have analyzed the losses suffered by the lowest order symmetric 
mode propagating on a dielectric slab waveguide caused by imperfec- 
tions of the waveguide boundaries. The analysis was simplified by assum- 
ing that there is no change in either the dielectric slab or the guided and 
unguided fields in one direction parallel to the slab. This assumption 
causes all our conclusions to be optimistic since variation of the slab 
in this direction can only cause additional losses. However, we expect 
that the results of this analysis give at least the correct order of magni- 
tude of the actual scattering losses. 

The statistical analysis was limited to a study of the effects which 
an exponential correlation function might have on the waveguide losses. 
The actual form of the correlation function may be quite different from 
this assumed exponential shape.t Conclusions regarding loss predictions 
are further hampered by a lack of knowledge of the expected correlation 
length. 

However, our analysis does lead one to conclude that scattering losses 
suffered by optical fibers or other dielectric waveguide structures may 
be very serious. Deviations of the waveguide wall in the order of a few 
percent can cause a power loss of 10 percent or 0.46 dB/cm if the wall 
imperfection can be described by an exponential correlation function 
with a correlation length to guide half width ratio of approximately 
B/d = 10. An rms deviation of A = 9 A causes a radiation loss of 10 
dB/km if the free space wavelength is \) = 1p and the guide has an 
index of refraction of n, = 1.01 (with vacuum on the outside). The 
width of the slab in this last example is 2d = 2.54. 

The mode coupling and radiation loss theory has been experimentally 
confirmed at microwave frequencies. A report on these measurements 
is given in Ref. 8. 


t Several other correlation functions have been tried and it was found that the 
results are insensitive to the particular choice of the function for values of B/d 
less than the value corresponding to the loss peak. In particular, the maximum 
loss value and the position of this loss peak were the same for different correla- 
tion functions. However, the loss values for B/d larger than the value correspond- 
ing to the maximum of the curve are very strongly dependent on the choice of 
the correlation function. 


MODE CONVERSION IN SLAB WAVEGUIDE 3215 


REFERENCES 


ioe) “NI for) up ce Nore 


. Kapany, N. S., “Fiber Optics,” New York: Academic Press, 1967. 
. Jones, A. L., “Coupling of Optical Fibers and Scattering in Fibers,” J. Opt. 


Soc., 68, No. 3 (March 1965), pp. 261-271. 


. Marcatili, E. A. J., unpublished work. 
. Miller, S. E., unpublished work. 
. McKenna, J., “The Excitation of Planar Dielectric Waveguides at p-n Junc- 


tions, I,” BS.TJ., 46, No. 7 (September 1967), pp. 1491-1526. 


. Collin, R. E., “Field Theory of Guided Waves,” New York: McGraw-Hill, 


1960. 


. Rowe, H. E., and Warters, W. D., “Transmission in Multimode Waveguide 


with Random Imperfections,” BS.TJ., 41, No.3 (May 1962), pp. 1031-1170. 


. Marcuse, D., and Derosier, R. M., “Mode Conversion caused by Diameter 


Charges of a Round Dielectric Waveguide,” B.S.TJ., this issue, pp. 3217- 
3232, 


Mode Conversion Caused by Diameter 
Changes of a Round Dielectric Waveguide 


By DIETRICH MARCUSE and RICHARD M. DEROSIER 
(Manuscript received July 8, 1969) 


This paper presents the theory of mode conversion and radiation losses 
of the lowest order circular electric mode in a dielectric rod (fiber) wave- 
guide and its confirmation by a microwave experiment. The theoretical re- 
sults were obtained from a theory whose detailed development has been 
presented in an earlier paper. 

The microwave experiment was carried out at approximately 50 GHz. 
The optical fiber with imperfect walls was simulated by a teflon rod of 1 cm 
diameter and 1 m length with a periodically corrugated wall. 

Mode conversion was observed in excellent agreement with theory. The 
observed radiation losses are somewhat less than the prediction of the 
perturbation theory, but the agreement ts quite good. The direction and 
width of the far-field radiation pattern was observed in agreement with 
theory. 


I. INTRODUCTION 


A theory of mode conversion and radiation losses of a guided mode in 
a dielectric slab was described in Ref. 1. The power conversion to spuri- 
ous guided modes as well as to the continuum of unguided radiation 
modes was assumed to be caused by deviations from perfect straightness 
of the air-dielectric interface of the slab. The model of the dielectric 
slab waveguide was chosen for its simplicity. 

Even though the dielectric slab exhibits all the relevant features of 
mode conversion caused by surface roughness and allows one to draw 
conclusions as to the order of magnitude of the losses suffered by guided 
modes in dielectric waveguides of other geometries, it is desirable to 
report the calculations for a round dielectric rod. The results of cal- 
culations for the dielectric rod are directly applicable to light trans- 
mission along optical fibers. Furthermore, we wanted to test the pre- 
dictions of the theory at microwave frequencies where a controlled 
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experiment, to check the effect of surface imperfections on mode guid- 
ance, is feasible. We present in this paper the theoretical treatment of 
the round dielectric waveguide with wall imperfections and its con- 
firmation by a microwave experiment. 

The mode conversion theory of round dielectric waveguides is only 
sketched in this paper since the basic method of calculation has already 
been described elsewhere. The theory is simplified by limiting the 
discussion to circular electric modes. In order to avoid coupling between 
the circular symmetric and other modes, we assume that the symmetry 
of the rod is such that all derivatives with respect to the angle ¢ of a 
cylindrical polar coordinate system (r, ¢, 2) vanish (0/d¢ = 0). 

We conclude again (as in Ref. 1) that the radiation and mode con- 
version losses caused by deviation of the waveguide walls from perfect 
straightness are extremely severe, imposing strict tolerance require- 
ments on the fabrication of low loss optical fiber transmission lines. 

To confirm the basic aspects of our theory we conducted a microwave 
experiment. Because of the ready availability of equipment, the fre- 
quency range of 50 GHz was chosen. Two teflon rods were used to simu- 
late optical fibers. Both rods had 1 cm diameters and a length of 1 m. 
One rod was smooth and was used for calibration and reference purposes, 
while the other rod was machined with periodic grooves to simulate an 
optical fiber with wall imperfections (ig. 1). 

The periodic wall perturbations cause two guided modes to be coupled 
together. In fact, it is possible to obtain complete power conversion 
between these two coupled modes. We have observed complete power 
conversion in agreement with our theory. 

In a certain frequency interval, the periodic grooves cause coupling 
to the continuous spectrum of radiation modes of the dielectric rod. 
The measured results are somewhat lower than the theoretical pre- 
diction. The reason for this discrepancy can be partly explained by a 


SMOOTH DIELECTRIC ROD 


CORRUGATED DIELECTRIC ROD 


1.93 cm 2b h 


Fig. 1— The smooth and corrugated teflon rods used for the microwave ex- 
periment (n,? = 2.05). 
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certain ambiguity in the value of the effective radius of the corrugated 
rod. If we make the assumption that the effective radius is either the 
largest or smallest radius of our rod, we obtain two curves which bracket 
our experimental results. However, our experimental values are consis- 
tently lower than the theoretical predictions based on an average diam- 
eter which is the arithmetic mean of the largest and smallest rod 
diameter. It is more likely, therefore, that the loss prediction of the 
perturbation theory is slightly too large for losses which are as high as 
those which occurred in our experiment. 

Our theory also predicts the far-field radiation pattern caused by a 
strictly periodic wall perturbation.” We have observed the peak of the 
far-field radiation lobe and its width in agreement with theory. 


Il. TE MODES OF THE DIELECTRIC ROD® 


Imposing the condition 


o] 
the transverse electric field is composed of the components 
Ho Ay ¢dles (2) 


The guided modes have the following form (normal modes of the perfect 
waveguide are indicated by script letters) 





S_ = AnJdilearye’ "for r<a (3a) 
J n lyse t(wt-Bnz 
6, S.A, AG HY (Gy we’? for r> a. (3b) 


The two magnetic field components are obtained from the #, component 


_ i abs 


= ee (4a) 
710 
x, = ane aE rH ,). (4b) 


The various symbols used in these equations have the meanings: 


a = radius of the dielectric rod, 

8, = propagation constant of mode n, 

kn = (nzk? — p2)', (5) 
Yn = (62 — k’)}, (6) 
k = 21r/d>) = free space propagation constant, 
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n, = index of refraction of the waveguide (rod), 
w = radian frequency, 
J, = Bessel function of order 1, and 


H;” = Hankel function of first kind and order 1. 


The boundary conditions, requiring that the field components &, and 
3C, are continuous at r = a, lead to the eigenvalue equation for B 


Yn J 1(kn@) = —74 Hy” (ipa) : (7) 
Kn J o(kn@) H3" (ty,0) 


The subscript 0 designates the Bessel and Hankel functions of zero 
order. It is convenient to express the mode amplitude A, by the actual 
power carried by each mode: 


i) Qa o 
P,=-3f drf{ dere,set = rl ele. fdr. (8) 
2 Jo 0 Wh Jo 


The modes will be normalized to the same amount of power (1 watt, for 
example) so that we write 


P, = P. (9) 
The mode amplitude can now be expressed as 
Quy P 


Aa = ar aa 
(1 + ) | Jo(kn@) J 2(kna) | 


(10) 


The modes of the continuous spectrum are given by the expressions 
& = Bien? r<a (11a) 
& = [CJi(pr) + DN, (pr) Je?!” r> a. (11b) 


The two magnetic components are again obtained from equations (4a) 
and (4b). N, is the Neumann function of order 1 and the parameters 
o and p are defined: 


o = (n2k? — B)', =p = (kh — 8’). (12) 


The normalization of the continuous modes involves the Dirac 6- 
function 


Palo — p') =e & [BER ar. (13) 
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The boundary conditions at r = a determine the relations between the 
constants C’, D, and B 


C 


B E pa(v(e0)No(oa) ~ © So(oa)Ns(oa)) (14a) 
B= —F pal vi(20)do(va) — £ Saloa)si(ee)) (14b) 


and these coefficients can be expressed in terms of the power carried 
by the mode 


Pega py. (15) 
POpL 


The actual field of a dielectric rod with imperfect walls can be expanded 
in terms of the normal modes of the perfect rod: 


B, = DO C.8 + [ 90800) do. (16) 


The remaining caluclation of the power loss to radiation and guided 
modes, as well as the energy exchange phenomena between different 
guided modes, are exactly analogous to those developed in Ref. 1 so 
that their derivation need not be repeated here. In Section III we simply 
quote the results of the corresponding calculations. 


Ili. SINUSOIDAL WALL PERTURBATION 


It was pointed out in Ref. 1 that a sinusoidal wall perturbation can 
couple only those two modes whose beat wavelength 


Qr 
= 17 
An Bo = Bn ( ) 


coincides with the mechanical period h of the wall perturbation. It is 
therefore possible to consider the coupling phenomenon between only 
two modes with the result that the coefficient C, of the incident mode 
and the coefficient C, of one of the spurious modes obey the relations 


Co(z) = cos | ko | 2 (18a) 


Ke 3 
Co = (sé) sin | ko. | 2 (18b) 


Ko 





with 
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(nt — 1) £2 (ka)? 
or 2a(Bo8;)? 
, J1(koa) J 1(k1@) : 
2 2 4 
(1 + \(1 + #2) Fut) Flea) Ilse db) | 
Yo v1 
Here, Ay is the amplitude of the sinusoidal wall deflection 
r(z) = a— A,gsin * 
= By — Bi 
The microwave experiment was conducted with a teflon rod with 
meandering grooves cut into it. The depth of the grooves is given by 


2b as shown in Fig. 1. The amplitude of the fundamental Fourier com- 
ponent of the periodic wall deflection of Fig. 1 is given by 


(19) 


(20) 


Ao Ss (21) 
Tv 
The two modes exchange their power completely over a distance 


D= (22) 


Sa cs 
2 | Ko1 |° 
The radiation loss of the dielectric waveguide of Fig. 1 can be calculated 
by the methods of Ref. 1 resulting in the following equation. 


1 tei 1)*(2) Gay 
a 





BE 

nS 1 Boa 
eee LC eee Ji(ona) 
(+ Di nennea Moms of (G) +B) 


(23) 


AP is the power lost to radiation modes on a section of the waveguide 
of length Z, and P is the power of the incident lowest order circular 
electric mode. The meaning of a and b is explained in Fig. 1. The sum 
in equation (23) takes account of the contributions of each component 
of the Fourier expansion of the distorted wall profile. The Fourier 
amplitudes of the function shown in Fig. 2 are 


Ab 
fae (2m + 1) 


{the zero component of this expansion appeared already in equation 


(24) 
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Fig. 2— The wall distortion function with Fourier expansion: 
ice) 


4b é 20 
2» Qm + 1x sin {((2m + 1) z4 


r= 


(21)]. The index m, which has been added to the coefficients B, C’, and 
D appearing in equations (14a and b) indicates that they must be 
evaluated for the following values of 


Bu = By — 2m + 1) =2, (25a) 
Om = (ngk° — Bn)? (25b) 
pn = (k* — 68) (250) 


The physical reason for the occurence of these discrete values of the 
propagation constant 6 in the continuous spectrum of modes is the 
requirement (derived in Ref. 1) that only those values of 8 are ap- 
preciably coupled to the incident guided mode which satisfy the relation 


Bo - B= 7 (26) 


where A,, is the period length of a Fourier component of the wall dis- 
tortion function. 


IV. THE STATISTICAL CASE 


To first order of perturbation theory, the expansion coefficient g(p, 2) 
appearing in equation (16) is given by 
k'ns — 1)(0)! 
Pe) ew Denar rs wore ams 
Her?) (2)%:(606)* 
gd (koa) J 1(ca) 


TOO Disease f 


(27) 
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with 
1 f” 
=+ [ (2) — alen 1" * de, (28) 


It was pointed out in Ref. 1 that the average power loss caused by scat- 
tering into the radiation field is given by 


(22) =f" doh Bas, (29) 


The symbol ¢ ),, indicates an ensemble average. The ensemble average 
of | o |’ is given by 


2 2 PP 
(Le Paw = | RQ) cos (Bo — Bn)u du (30) 
0 
with the correlation function 


R(u) = (f@) — aif + u) — a))ev- (31) 


The relative power loss caused by radiation from the rod is obtained 
from equations (27) and (29) 


Pm a =D? __Filcoa) * Le Poa] Ji(ca) 

iAP «) 3) Teantar (cy, (By He ee) 
B B 

V. NUMERICAL RESULTS FOR THE STATISTICAL CASE 


To be able to make numerical predictions, let us assume that the 
correlation function is given by 





Oo ee (—1el) (33) 
so that we obtain 
‘ BAe 
LK ¢ Py = -——+—_.. (34) 
(Bo re 6)” + B? 


Figure 3 shows a plot of (a°/LA’)(AP/P) as a function of B/a for 
n, = 1.01, ka = 23.0 and n, = 1.5, ka = 3.0. Both conditions are chosen 
so that only the lowest order circular electric mode can propagate in 
the dielectric rod. 

To get a feeling for the magnitude of the losses to be expected from 
random variations of the rod’s radius, we calculate the rms deviation 
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Fig. 3— Normalized radiation loss caused by ee wall perturbations with 
exponential correlation function, @ = radius of fiber, A = rms value of wall 
deviation, L = length of waveguide section, k = free space propagation con- 
stant. The dimensions shown in the figure were chosen to ensure single guided 
mode operation. 


A required to cause AP/P = 0.1 for a rod length of Z = 1 em for 

= 1.01 and the worst possible value of B/a = 2. Assuming \ = ly 
we get from ka = 23 the value a = 3.66y for the guide radius. With 
(from Fig. 3) 


3 
a 


_— 


27, = 0.16 


Bie 


we find 


= 1.5 xX 10° = 1.5% 


& |X 


or 
A = 550A. 
As discussed in Ref. 1, it may be permissible to apply the perturbation 


calculation of the radiation loss repeatedly so that from 


P = Pet (36) 
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can be obtained. We can then ask for the rms deviation A of the rod’s 
radius which causes a loss of 10 dB/km. With the numerical values 
used above we find 


A =84 A. 


Almost the same figure was obtained for the rms deviation of the half 
width of the dielectric slab which causes a 10 dB/km radiation loss of 
the lowest order (even) guided mode. However, in the case of the slab, 
one wall was assumed to be perfect. 


VI. THE MICROWAVE EXPERIMENT 


The experimental setup is shown in Fig. 4. The microwave signal is 
generated by a reflex klystron whose rectangular waveguide output is 
fed into a round waveguide by means of a rectangular-to-round wave- 
guide transducer. The round waveguide is connected to a section of 
round helix waveguide which serves as a mode filter suppressing all 
but the circular electric TE;” mode. Transition between the TE‘” 
mode of the round waveguide and the corresponding TE,; mode of the 
dielectric rod waveguide is achieved by inserting the rod into the wave- 
guide. This mode launcher is not perfect since a small amount of TEo2 
mode of the dielectric waveguide is excited. The TE‘ mode of the 
round waveguide cannot excite the pure TE»; mode of the dielectric rod 
since the field configurations of the two modes are slightly different. 
In addition to some residual TE,2. mode, small amounts of asymmetric 
modes of the dielectric rod are also excited because of imperfect center- 
ing of the rod inside the round waveguide. 

To probe the field outside of the dielectric rod and detect the con- 
version of power from the TE, to the TE. mode, we used a probe which 
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Fig. 4— Block diagram of the microwave experiment. 
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Fig. 5— Buildup of the TE. mode along the corrugated rod. Groove depth 
= 76 < 10°73 cm. 


consisted simply of an L-shaped piece of RG 98U waveguide which was 
mounted on an optical rail, which made it possible to move the detector 
parallel to the dielectric rod. The receiver attached to the L-shaped 
probe consisted of a single diode detector followed by an amplifier which 
was tuned to 250 Hz. The klystron was amplitude modulated at that 
same frequency. The periodicity of the grooves of the corrugated 
dielectric rod (Fig. 1) was chosen equal to the beat wavelength between 
the TE), and TEo2 modes of the dielectric rod as given by equation (17). 

Mode conversion from TE,, to the TE: mode can easily be observed 
with our detector arrangement because the TE 2. mode extends much 
farther away from the rod than the more tightly confined TE), mode. 
Moving the detector to approximately 4 mm from the surface of the 
rod made it impossible to observe any trace of the TE,,; mode, while 
the TE 2 mode could easily be detected. 

That the corrugation does indeed serve to transfer power from the 
TE»: to the TEo2 mode is shown in Fig. 5. The measured values of 
TEo2 power are shown as dots on this figure. Also shown is a plot of the 
sin’ x function which gives the theoretical law of the power increase 
according to equation (18b). The slight scatter of the measured points 
is caused by interference between the TE,. mode and some other residual 
mode which is unintentionally generated by the mode launcher. From 
equation (22) we calculate D = 80cm for our particular experiment. 
From Fig. 5 we see that the experimental value of the total energy ex- 
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change length is approximately 75 cm. The remaining discrepancy be- 
tween the theoretical and experimental values can easily be attributed 
to the machining accuracy of the rod which was no better than 2.5 X 
10° cm. Striking proof of the identity of the mode whose buildup is 
shown in Fig. 5 is provided by Fig. 6. 

Figure 6 was obtained by moving the L-shaped detector transversely 
at the end of either the smooth or the corrugated rod. The detector is 
thus probing the near field radiation pattern which results as the guided 
mode leaves the end of the rod and radiates into space. This near field 
radiation pattern is a faithful reproduction of the shape of the guided 
mode inside of the waveguide. The solid curve shown in Fig. 6 was 
obtained by probing the transverse field pattern of the smooth rod. 
This field pattern shows clearly the TE», mode. There is a slight dis- 
tortion in the wings of this mode which is caused by interference between 
the TE); mode and a small amount of TEo2 power launched by the trans- 
ducer. The dotted curve in Fig. 6 was obtained by placing the detector 
at the end of the corrugated rod. We took care to insert the corrugated 
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Fig. 6— The near-field radiation patterns of the guided modes (transverse 
field distbiton). Solid line = TE mode at end of smooth rod; dotted line = 
TEve mode at end of corrugated rod. 
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Me 7— Buildup of TE. mode along the corrugated rod. Groove depth = 23 
x 10-2 cm. 


rod so far into the launcher that the section protruding from the launcher 
was equal to the total power exchange length shown in Fig. 5. It is 
apparent that the TE). mode (instead of the TE); mode generated by 
the launcher) is present at the output end of the corrugated rod. It is 
also apparent that almost complete mode conversion has taken place. 
Figures 5 and 6 were obtained from a corrugated rod whose grooves 
had a depth of 7.6 X 107° em. In order to be able to observe radiation 
losses, we deepened the grooves in this rod to a depth of 2.3 X 107° 
The power buildup as a result of mode conversion from TE), to TE: 
on the rod with deeper grooves is shown in Fig. 7. The TE,, mode is 
shown to go through two complete power exchanges. The exchange 
length 1s now 25 cm in agreement with theory. 

Finally, we observed the radiation of power from the corrugated rod 
with the deeper grooves. Equation (26) indicates the relation between 
the z-component of the propagation vector of those radiation modes 
that couple to the TE,,; mode and the period of the periodic corrugation 
of the rod. It is clear that the basic Fourier component with length 
A, of the corrugated wall distortion function will contribute predomi- 
nantly to radiation loss. Furthermore, since 8 < k is required for all 
radiation modes, we see that only very little power can be lost to radia- 
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tion unless the relation 


B—-kst (37) 
is satisfied. It follows from equation (37) that above f = 51GHz very 
little radiation loss is to be expected. Indeed we see in Fig. 7 that com- 
plete energy exchange between two guided modes is taking place which 
would be impossible if substantial amounts of power had been lost to 
radiation. However, below 51 GHz, equation (23) predicts considerable 
radiation loss. 

The applicability of the radiation loss theory to our experiment is 
somewhat questionable. We must not forget that equation (23) was 
derived from a perturbation theory under the assumption that only 
very little power is lost from the original guided mode. If the radiation 
detaches itself from the rod over a distance for which the power loss 
of the guided mode due to radiation is only slight, we may be justified 
in making the transition to equation (86). However, this procedure 
becomes more and more questionable as the radiation losses increase. 
I'urthermore, the transition to equation (86) is less likely to be ac- 
curate if the radiation is directed forward along the rod. It is shown in 
Ref. 2 that forward radiation results close to the region where the equal 
sign of equation (87) applies. 

Finally, there is some uncertainty what value ‘a’ for the rod’s 
radius should be used in equation (23). Since the radius of the cor- 
rugated rod is variable, some suitable average value must be taken. 
Figure 8 shows three theoretical curves. The two dotted curves were 
calculated using the largest and smallest value of the radius in equation 
(23). The solid curve was obtained by using the average value of the 
radius. The crosses in Fig. 8 show the results of our loss measurements. 
It is apparent that most of these points fall within the two dotted curves. 
However, all points lie below the solid curve. These loss measurements 
were obtained by comparing the output power at the end of the smooth 
and corrugated rod. The accuracy of these measurements is no better 
than approximately +3 dB. In view of the discussion of the applica- 
bility of the perturbation theory to high radiation losses, the agreement 
between theory and experiment must be considered as good. 

Figure 9 shows the angle of the far-field pattern of the radiation lobes 
caused by power loss due to the corrugated wall. The dots are measured 
values, while the curve is a result of the theory of Ref. 2. Again we see 
good agreement between experiment and theory. 


MODE CONVERSION 3231 








Loss IN dB 




















48 50 
FREQUENCY IN GHz 


Fig. 8— Radiation loss as a function of frequency. Dotted lines represent 
theoretical loss assuming that guide radius is either the maximum or the mini- 
mum value. Solid curve shows theoretical loss based on average radius of cor- 
rugated rod. The dots are the measured points. 
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Fig. 9— The angle of the far-field radiation lobe as a function of frequency. 
Solid-line represent theory; the dots are measured points. 


3232 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


VII. CONCLUSION 


This paper contains a perturbation theory of mode conversion effects 
and radiation losses of a round dielectric waveguide. This theory is 
applicable to light transmission in optical fibers. The theory developed 
here is limited to the circular electric modes of round dielectric wave- 
guides. However, the order of magnitude of the losses for other modes 
is expected to be similar. 

The theory has been checked by a scaled experiment at microwave 
frequencies. The dielectric fiber with wall imperfections was simulated 
by a teflon rod of 1 cm diameter which was provided with periodic 
grooves. Mode conversion from the TE, mode of the dielectric rod to 
the TE,y2 mode was observed in excellent agreement with experiment. 
The observed radiation losses are in reasonable agreement with theory. 
An existing discrepancy can be attributed to the limitations of the 
perturbation theory to predict correctly the high losses encountered 
in this experiment. 

The conclusion to be drawn from our theory for the operation of 
optical fibers is a need for very strict tolerance requirements. For ex- 
ample, the radiation losses caused by surface roughness of a fiber 
designed for single mode operation at 1y wavelength can be as high as 
10 dB/km for an rms variation of the fiber wall of as little as 8 A. 
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Radiation Losses of Dielectric Waveguides 
in Terms of the Power Spectrum 
of the Wall Distortion Function 


By DIETRICH MARCUSE 
(Manuscript received July 23, 1969) 


In an earlier paper I described a perturbation theory of the radiation 
losses of a dielectric slab waveguide. The statistical treatment of the radiation 
losses was based on the correlation function of the wall distortion. This 
paper discusses the results of the radiation loss theory in terms of the power 
spectrum of the function describing the thickness of the slab. We found that 
only those mechanical frequencies 0 of the power spectrum contribute to the 
radiation loss that fall into the range By — k < 6 < By +k. (6 = prop- 
agation constant of guided mode, k = free space propagation constant.) 
The mechanical frequencies near both end points of this mechanical fre- 
quency range contribute more to the radiation loss than the region well 
inside of this range. 

We also discuss the far-field radiation pattern caused by a strictly 
sinusoidal wall distortion. 


I. INTRODUCTION 


In an earlier paper I developed a perturbation theory of the mode 
conversion effects between guided modes and of the radiation losses of 
a given guided mode caused by deviations from perfect straightness of 
the waveguide wall.’ For simplicity, the discussion had been limited to 
a waveguide in the form of an infinitely extended dielectric slab. 

The statistical discussion had been based on the description of the 
wall distortion by means of a correlation function. In Ref. 1 an exponen- 
tial correlation function had been assumed. However, it has been estab- 
lished that the shape of the correlation function has little influence on 
the radiation losses. 

It is possible to base the discussion of radiation losses not on correla- 
tion functions, but on the mechanical power spectrum of the wall dis- 
tortion function. This study provides information as to how the various 
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mechanical frequencies of the wall distortion function contribute to the 
radiation losses. 

The analysis of Ref. 1 was based on the use of radiation modes of the 
dielectric slab which represent standing waves in directions transverse 
to the propagation direction of the guided modes. The question naturally 
arises how a superposition of these standing waves can result in radiation 
flowing away from the rod. This question is answered by examining 
the far field radiation pattern caused by a sinusoidal distortion of one 
wall of the dielectric waveguide. This paper gives the relation between 
the length of the mechanical period, the wavelength of the guided mode, 
and the direction of the main lobe of the radiation. 


II. RADIATION LOSS AND POWER SPECTRUM 


The amplitudes of the modes of the continuous spectrum were derived 
in Ref. 1, equations (65) and (69). We have 


p(COS ko d cos od)[y(0) — ¥(6)] 
a+ &) (p’ cos’ o d + o’ sin” -)| 


(1) 


1 


g.(o, B) = save  — 1) 
[5(s 


for the even modes, and 
p(COs ko d sin o d)[y(6) + ¥(6)] 


3 
od + Bs) Go sin? ¢ d + o° cos” <a) | 


su(0, L) = a 
B\B 


(2) 
for the odd modes. The functions 
1 f” ae 
o) =z Y@ — de" ae, 3) 
WO =z | (he) + de" ae, (a) 
with 
6=6,-8 (5) 


are the Fourier transforms of the wall distortion functions f(z) — d and 
h(z) + d. [x = f(z) is the boundary of the dielectric-air interface, x = d 
describes the wall of the perfect guide, and « = A(z) is the distorted 
boundary near x = —d.] 
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The meaning of the constants appearing in equations (1) to (5) is: 


Bo = propagation constant of guided mode (propagating in z-direc- 
tion), 

8 = component of the propagation constant of the continuum mode 
in z-direction, 


k = propagation constant in free space, 
L = length of guide section with wall distortions, 
n, = dielectric constant of slab, 
p = (k° — py? (6) 
o = (nk? — p°)}, (7) 
= (n2k* — B2)}, (8) 
0 = (85 — k’)}. (9) 


The y-component of the electric radiation field caused by the wall 
distortions is given by 


ss is [g-(p, L)&.(p, 2) + gol, L)&o(p, 2)] dp. (10) 


The functions &, and &, are the even and odd radiation modes. The ratio 
of scattered power to incident guided mode power is obtained from 


AP : 
= | (alo, 2) |? + | goto, L) 2 ag. (11) 
-k p 
For simplicity we assume that one wall of the slab is perfect 
h(z) = —d, (12) 
so that 
¥(0) = 0, (13) 
the relative scattering loss, follows from equations (1), (2), and (10) 
k 
1 
> = | Bho F1@) a8 (14a) 
with 
_ (kd) 2 42 Cos” sor nt E Te cos od 
Bes Mie) aud + , °) | Ged)? cos” od + (od) sin” od 
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sin’ od 
T (pay? sin” od + (cd) cos” a) a) 
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Since ¢(6) is the Fourier component of the wall distortion function its 
absolute square value 


| (6) |’ (15) 


is the “power spectrum”’ of f(z) — d. It is apparent from equation (14) 
that AP/P depends on the power spectrum of the wall distortion func- 
tion. Incidentally, equation (14) is not a statistical expression, but holds 
for a specific dielectric slab waveguide. We entered the power spectrum 
in the combination L | ¢ |’ in equation (14) since this combination is 
independent of L for a randomly varying function f(z) — d. 

Equation (14) allows us immediately to determine the range of 
mechanical frequencies @ which contribute to the radiation loss. The 
integral in equation (14) is extended from —k to k, the 8 range of 
continuous radiation modes. The range of mechanical frequencies con- 
tributing to the scattering loss is therefore given by 


By -k<60< 6B +k. (16) 


This is an important result since it states that those parts of the power 
spectrum which lie outside of the range, equation (16), do not contrib- 
ute to radiation loss. 

This last statement must not be misconstrued to mean that a wave- 
guide with a sinusoidal wall distortion extending over length L 


f(z) =d+asin ’z OszsL (17) 


with 6’ lying outside the range of equation (16) does not lose power by 
radiation. The power spectrum of equation (17) is 
2 
és sin (@’ — @) é 
2 = eee so . 


A term with 6’ -++ 6 in the denominator has been neglected in equation 
(18). The accuracy of this approximation improves with increasing 
values of L. 

It is apparent from equation (18) that | ¢(6) |? has non-vanishing 
values for 6 # 6’ so that there is some small contribution to radiation 
loss even if 6’ lies outside of the range of equation (16). 

However, if we consider the limit Z — © we can approximate the 
power spectrum, equation (18), by a 6-function: 


lim | o(@) |? = 57 8(0 — 6”). (19) 
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In this special case the expression (14a) for the scattered power becomes 


PF -3G) mo ‘“ 


The scattering from a dielectric waveguide with a wall distortion func- 
tion whose power spectrum is a 6-function is proportional to I(8, — 6’). 

The function J(@) is plotted in Fig. 1 for n, = 1.01, kd = 8.0, and 
Bod = 8.041. The scattering caused by a wall distortion with a 6- 
function spectrum (a sinusoidal wall distortion of infinite length) is 
nearly independent of the value of 8 = Bo — 6’ over most of the B-range. 
There are two sharp peaks at 8 © k and 8 } —k. The physical reasons 
for the sharp increase in loss at these values is easy to understand if 
we consider the direction of the radiation pattern as a function of 6’. 
We show in Section III [equation (85)] that the angle a between the 
waveguide and the main radiation lobe is given by 





cosa = 6 = Po mS (21) 


The two peaks of the function J(8), or correspondingly of the radiation 
loss, are associated with 
aXe0 and af& do. (22) 


This shows that the radiation loss is high when the radiation pattern is 


x1073 














Fig. Peg yiaaay representation of the function /(g) [eq. (14b)]. n, = 1.01, 
kd = 8.0, Bod = = 8.041. 
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directed very nearly parallel to the surface of the waveguide. The 
radiation modes gain more power if the guided mode can interact with 
them over a longer distance. An observation of this loss peak is re- 
ported in Ref. 2. 

A power spectrum with sharp peaks much like that of equation (18) 
or (19) is not likely to occur for dielectric waveguides with random 
imperfections of the dielectric interface. It is much more reasonable to 
expect that such waveguides may have spectral distributions which are 
nearly independent of 6 over a certain range of 6 values. In the limit 
of a “white” spectrum, 


| o(6) |? = constant, (23) 


the scattering loss is proportional to the integral over the function I(8) 
shown in Fig. 1. The two peaks contribute very little to this integral. 
Numerical integration of I(@) of Fig. 1 including and excluding the peaks 
resulted in the values: 


i 1(8) d8 = 0.011, i ~ 1(8) d8 = 0.0096, 


7.5 
aad [ _ 1B) a8 = 0.0087, 


This result is reassuring for the use of the perturbation theory which 
was used to derive equation (14). The perturbation theory is based 
on the assumption that power is converted from the guided mode to 
the radiation field but that no power is converted back from the radiation 
field to the guided mode. This approximation is certain to yield better 
results if the radiation pattern is directed away from the rod. In other 
words, the perturbation theory will work poorest in the region of the 
peaks of Fig. 1. However, for spectra that do not particularly favor the 
regions of these peaks, the contribution of those regions (which at the 
same time give the least reliable results) to the total radiation loss is 
only slight. 


III. THE FAR FIELD RADIATION PATTERN 


The far field pattern of the radiation field (that is excited by the 
lowest order even guided mode traveling in the dielectric slab with 
sinusoidal perturbation of one wall) can easily be calculated from equa- 
tion (10). The even and odd radiation modes were given in Ref. 1 (for 


|x| > d) 


WAVEGUIDE RADIATION LOSSES 3239 








ete) a |=; 2wuP | 
J mB(p’ cos’ cd + a’ sin’ od) 


X [p cos p(\z| — d) cos od — osin p(|x| — d) sin cd]e’‘*’”” (24) 


a? = peer 2 | errr Oe | 
|x| LaB(p" sin” od + oa” cos’ od) | 
X [p cos p(|z| — d) sin od + osin p(|x| — d) cosadje'“’'*”. — (25) 
With (6) = 0 and 
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and with the a of equations (1) and (2) we get from equation (10) 
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sin od[p cos p(x — d) sin od + osin p(x — d) cos cdl} 
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eee: 
oe yz) sm (0’ — @) 5 aia 
X exp | 7(6’ — 86) i Speeye xe dp. (27) 


In the far field with z — © and z— o (but L finite) we can obtain an 
approximate solution of the integral in equation (27) by the method of 
stationary phase.* The sine and cosine functions of argument p(x — d) 
can be expressed as sums of exponential functions. The most important 
terms of the integrand of equation (27) are, therefore, of the form 


exp [—7(6z + px)]. (28) 


This exponential term is an extremely rapidly varying function of p as 
x— © and z— o. All other terms in the integrand vary slowly by com- 
parison. According to the method of stationary phase the contribution 
to the integral comes predominantly from a region that is determined by 


2 (pices) E20: (29) 
p 
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With the help of equation (6), equation (29) leads to the condition 


= a3 (30) 
or 
po = ksine (31a) 
B =k cosa (31b) 
with 
cosa = wpe = -. (32) 


For x > 0 and z > 0 only the + sign in equation (80) is possible. This 
is an important point. It shows that even though the radiation modes, 
equations (24) and (25), represent standing wave patterns in x-direction 
only, the outward traveling part of the decomposition of the standing 
wave into traveling waves makes a contribution to the radiation field, 
equation (27). 

All terms of the integrand with the exception of equation (28) can be 
taken out of the integral. The remaining integration can be carried out 
using the expansion 


; 1 
Bz + px = k(x sina + 2 cosa) 5 et Po) ote 


3 
COSa@ -; i 
z ; € tk(z sin atz cos a (33) 


ae paris dp > (l + a(n)? a ) 


The far field is therefore obtained in the form 


COS Kod 


(s.a + £2) 
Yo 


5 ue Tw) 3 Mn? 
£, = Gn! exp (i «| ak?(wpyP)?(n, — 1) 


sin (0 — 6) 5 


_ po Sin 2Zood — tpooy COS 2ood 
(p3 + 0%) sin 2ood — Zipooy cos 20d 6’ — 6 





-exp co Pa Q) 4 ered ae sin a+z cos anh. (34) 


The index zero was added to o to indicate that it must be evaluated from 
equations (7) and (8) using py» of equation (81a). 
Equation (84) reveals several important features of the far field of 
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radiation. This field is essentially a plane wave traveling in the direction 
of a (tan a = 2/2, and x and z are the coordinates of the point of ob- 
servation). 

The field intensity is inversely proportional to the square root of the 
distance r from (the sinusoidally distorted) waveguide section. The 
dependence on distance is inversely proportional to (r)* rather than r 
because the waveguide is infinitely extended in y-direction (see Ref. 1). 

The main radiation lobe occurs at the maximum value of [sin (@’ — 
6)L/2|/(6’ — 6) that is at 6 = @’ or from equations (5) and (81b) at 


Bo — & 


COS Ay = k 


(35) 
(89 = propagation constant of guided mode). 

The width of the main lobe depends on the length L of the sinusoidally 
distorted waveguide section. The difference in angle between the peak 
of the lobe and the first null determines the half width of the main lobe 


20 
Aa = Tien fora ~ 0. (36a) 
The width of the main radiation lobe is inversely proportional to L. 
The lobe is narrowest for a = 7/2 and becomes wider as a decreases 


toward zero. If the peak of the main lobe is at a = 0, we obtain 


3 
Aa = (27) fora = 0. (36b) 
The peak amplitude of the main radiation lobe is not strongly dependent 
on a. The increase in radiated power in forward direction (a = 0) which 
is apparent from Fig. 1 is caused by the broadening of the radiation 
lobe with decreasing angle. 


IV. CONCLUSION 


The radiation loss of dielectric waveguides caused by deviations from 
perfect straightness of the waveguide walls depends on the “power 
spectrum”’ of the wall deviation function. A sinusoidal wall perturbation 
gives rise to radiation into a particular direction in space. Each Fourier 
component of the Fourier expansion of the wall distortion function is 
responsible for radiation into a particular direction. The width of the 
radiation lobes is wide for scattering directions parallel to the rod so that 
those Fourier components responsible for forward and backward scat- 
tering contribute more to the radiation loss than those causing scat- 
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tering in other directions. However, this preferential loss behavior is not 
very pronounced, so that the Fourier components responsible for forward 
and backward scattering contribute only a small amount of the total 
radiation loss caused by a broad power spectrum. 

The coupling between two guided modes of the dielectric waveguide is 
also governed by equation (5). Only one component of the power spec- 
trum of the wall distortion function influences the coupling between two 
guided modes, while the entire range of mechanical frequencies, equa- 
tion (16), determines the radiation loss. 

The general predictions of this theory have been experimentally veri- 
fied. Microwave experiments on a periodically corrugated teflon rod have 
shown that the radiation losses are negligibly small if the period of the 
corrugation is such that 6 lies outside of the interval indicated by equa- 
tion (16).” However, if 6 falls inside of the interval, equation (16), con- 
siderable radiation losses do occur. The peak of the radiation losses 
shown in Fig. 1 and the direction and width of the radiation lobes have 
also been observed in agreement with this theory. 
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Amplitude Distributions of Telephone 
Channel Noise and a Model for 
Impulse Noise 


By J. H. FENNICK 
(Manuscript received June 30, 1969) 


The noise waveforms found on voice bandwidth telephone channels are 
generally recognized to be non-gaussian in their amplitude distribution. 
This paper presents data which suggests that a simple exponential ts a 
good function to describe amplitude densities in the extreme tails. 

A comprehenisve model of ampulse noise as viewed on trunk groups is 
then presented. The model relates the distributions of impulse noise levels 
and impulse noise counts. 


I. INTRODUCTION 


Noise on telephone channels has been measured for years with in- 
struments which are constructed to enable reasonably good correlations 
between the reading obtained and the annoyance of the noise during a 
telephone conversation.’ Fluctuations of the meter pointer during a 
measurement are either ignored or mentally averaged by the observer, 
depending upon their frequency of occurrence and their magnitude. 
With the introduction of data transmission on the telephone network, the 
relatively frequent high amplitude excursions of the noise waveform 
were viewed as a “new” kind of noise, primarily because they were 
generally not annoying in voice communication and it was recognized 
that no meaningful measure of them could be obtained with the stand- 
ard noise measuring sets. The term “impulse noise’ was applied to 
these high excursions and new instruments were designed to measure 
them.” 

The significance of impulse noise in data transmission has given rise 
to a great deal of effort devoted to its measurement, characterization, 
and evaluation as a transmission impairment.*° (For an extensive 
bibliography, see Ref. 3.) Several models have been suggested to de- 
scribe the erratic behavior and clustering phenomena associated with 
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this type of noise. The Pareto model of Berger and Mendelbrot and the 
generalized hyperbolic model proposed by Mertz appear to be the best 
presented to date.*"’ A more mathematically tractible (than the hyper- 
bolic) model has recently been applied to error rate data by Fritchman. 
He proposed a partitioned Markov chain model which would seem to 
show promise in this area although it does not seem to have been ap- 
plied to impulse noise data as yet. The model presented here does not 
deal specifically with the intervals between occurrences of noise pulses 
but is concerned directly with the number of occurrences per unit 
time above any threshold (in decibel) of observation. Extrapolation of 
occurrences of noise pulses to errors created in data transmission is a 
function of many parameters besides the occurrence of noise and will 
not be discussed here although good prediction techniques exist.” 

In order to set the background for the discussion of impulse noise as a 
separate phenomenon, as opposed to the background noise or as a part 
of the composite noise waveform on a channel, data are first presented 
on the amplitude probability density function of the noise as observed 
and comparisons made with gaussian noise. The data reflect only the 
range of variables encountered and should not be considered as statis- 
tically describing the amplitude distributions of noise on telephone 
channels. 


II. IMPULSE NOISE AS A DISTINCT PROCESS 


Typical oscillograph noise waveforms from a random noise generator 
and from a telephone channel are shown in Fig. 1. Each trace is 200 ms 
long and both have the same rms value. The upper one is from the noise 
generator, the lower one from a telephone channel. The occurrence of 
two “impulses” are shown near the left end of the lower trace. It is 
primarily the occurrence of such “pulses” that make real channel noise 
decidedly different from band-limited white gaussian distributed noise 
(the upper trace). 

Figures 2a and b show two such impulses extracted from a noise 
recording, sampled at a 15 kHz rate and analyzed to determine their 
amplitude and phase characteristics in the frequency domain. In both 
cases, the phase characteristic is shown to be relatively smooth, but the 
frequency content highly variable. Similar analyses on about 2000 noise 
pulses verified these observations. However, if a large sample of pulses, 
on the order of 200, is taken from a given channel, the average spectrum 
appears to be approximately the shape of the channel gain-frequency 
characteristic—not a very surprising result. Such an averaging is shown 
in Fig. 3. 
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Fig. 1— 200 ms samples of random noise and telephone channel noise with 
equal rms levels. 
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Fig. 2—Samples of impulses extracted from telephone channel noise with 
their amplitude and phase characteristics in the frequency domain. 
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Fig. 3— Average spectral content of about 200 impulses from a single tele- 
phone channel. 


Figures 1 and 2 serve as partial justification for treating impulse noise 
as a separate phenomenon. The pulses shown in Fig. 1 do not rise to 
strikingly high amplitudes compared to the rest of the noise waveform. 
Those in Fig. 2, however, are so large that the scale prohibits viewing the 
background noise waveform which continues beyond that shown. This 
extreme peaking will become more apparent in Section III. 


Ill. PERCENT OF TIME WAVEFORM IS WITHIN AN INTERVAL 


The percentage of time that the noise is within a given interval 
(+0.5 dB in this case) is a useful means of describing a random wave- 
form. Data in the form of histograms were obtained by sampling, at a 
10 kHz rate, 30 minute tape recordings of telephone channel noise. 
Equipment limitations imposed a usable dynamic range of 30 dB, so 
the apparatus was adjusted to examine only the extreme peaks of the 
noise. In practice this usually required that the noise be examined at 
levels corresponding to percentages of 10~” or less. This approach was 
also consistent with the nature of the problem—the relatively high noise 
amplitudes were of greatest interest. Logarithmic compression and 
decibel scaling were used and resulted in a unique presentation of the 
data. Instead of the usual scaling in voltage, the abscissa is scaled in 
decibels removed from the rms value of the noise. A negative sign pre- 
ceding an abscissa value refers simply to one polarity of noise waveform, 
a positive sign refers simply to the opposite polarity. Zero on the ab- 
scissa corresponds to the rms value of the noise waveform. For con- 
venient comparison, the equivalent data for a gaussian distribution 
are also shown in each of the figures presented. The ordinate, proportion 
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of time the waveform is within +3 dB of the indicated level, is presented 
in powers of 10 from 107? to 10™*. 

Figures 4 and 5 show the histograms as measured on two different 
channels. Figure 4 was taken from data recorded on a coaxial cable sys- 
tem and Fig. 5 from a microwave radio system. The striking departure 
from a stationary gaussian process is obvious. The sampling rate of 
10 kHz over a 30 minute period resulted in 18 X 10° samples. Values 
on Fig. 4 of 5.5 X 107° represent one sample in 18 million and can 
hardly be considered significant. The values of 10° shown on Fig. 4 
represent voids in the data. Figure 6 shows a histogram constructed by 
combining seven 30 minute recordings, and so represents an “‘average”’ 
histogram over 3.5 hours of real time. The result is surprisingly linear 
for values below about 5 X 10°° and suggests that the tails of the am- 
plitude distribution of real channel noise are approximated quite well 
by a simple exponential. 

A total of 37 half hour recordings were analyzed in this fashion. 
Seventeen of these were taken from microwave radio channels and 20 


{ 
nN 


























FRACTION OF TIME ABOVE ABSCISSA IN POWERS OF 10 
t 
te) 





| 
@ 


-50 -40 -30 -20 -10 oO 10 20 
HISTOGRAM OF NEGATIVE AND POSITIVE NOISE 
WAVEFORM IN GB REMOVED FROMIMS VALUE 


fe) 


Fig. 4— Histogram of telephone channel noise amplitudes compared with 
gaussian distribution. Sample from a coaxial transmission system. Proportion of 
time that the noise waveform is within +0.5 dB of abscissa value. Abscissa is 
decibel removed from rms. 
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from various types of cable or coaxial carrier systems. The variability 
observed on the microwave systems is much greater than that on cable 
systems so the two sets of data are treated separately. 

Since no data are available on the amplitude histograms at values in 
excess of 10~*, it is assumed here that the histogram for such values is 
represented by a truncated normal function. The observed data suggest 
that, if the noise is stationary, its amplitude density function then may 
be written: 


0; z<—b 
ce’; —banzs-a 
p(x) = 74; -a<au<a (1) 
ce"; Cae 6 
0; x>b 


where 


-+kb = realistic bounds on the voltage waveform (channel saturation), 
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Fig. 5— Histogram of telephone channel noise amplitudes compared with 
gaussian distribution. Sample from a microwave radio system. Constructed as 
in Fig. 4. 
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Fig. 6 — Histogram of noise amplitude from cable and coaxial systems taken 
over 3% hours. Constructed as in Fig. 4. 


-+-a = points of departure of the density function from an assumed 
underlying Gaussian, 
$ = gaussian density truncated at -ka. 


c, k = parameters describing the exponential density function. 


The value of b ranges from about 30 to 50 dB*, the value of a ranges 
from about 10 to 15 dB*, and k may be negative as illustrated in Fig. 5. 
The variable c ranges over 14 orders of magnitude from 10~’ to 10’. 
No significant correlations were found between any of the variables 
in the data analyzed. The point of departure from the assumed under- 
lying truncated gaussian distribution a is given by the positive solution 
of the quadratic 
a=k + {k? — 21n [c(2z)*]}}. 


Some values of k and ¢ are given below. 


3.1 Histograms on Cable and Coaxial Carrier Systems 


As stated earlier, the histograms on cable and coaxial carrier systems 
showed less variability than those on microwave radio systems. In fact, 


* That is, above the rms value. 
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two of the 20 observations tracked the assumed gaussian distribution 
to within less than 3 dB over the entire range from 107° to 107’. These 
two observations lend credence to the assumption of an underlying gaus- 
sian process and also show that at least two channels had no impulse 
noise in the sense of the term as defined in Section I. 

The data are summarized in two ways. First, the values of the vari- 
ables k and c were examined, and then the intercepts (abscissa values) 
for various values of proportion were studied. 

For cable and coaxial carrier systems (20 samples)* the mean of k 
was 0.45 and the estimated standard deviation s was 0.46. Because of 
the extreme range of c, as mentioned in Section III, only the median 
appears to be of interest; it was found to be 0.0028. A probability density 
function using the mean value of k and the median of c is shown in Fig. 
7. The resultant exponential departs from the guassian distribution at 
about 4.5 X 107° on the ordinate. 

The second method of examining the data is considered to be more 
meaningful in terms of a representative average. The intercepts at 
proportion values of 107* to 10°’ were studied. The mean (z),, (in dB), 
estimated standard deviation s, median, and 90 percent confidence 
intervals (CI) about the mean, are shown in Table I. The average and 
median functions so derived are also shown in Fig. 7. The distributions 
of intercepts were found to be very nearly log-normal for all four pro- 
portion values (10~* through 107’). This explains the differences between 
the means and medians as in Table I and Fig. 7. A skew distribution of 
the intercepts is of the form to be expected. A lower bound on the inter- 
cept is imposed by the gaussian assumption and a gradual tailing off of 
the values at the high range might be expected. 

Taking the median value of the exponential distribution as being a 
representative value of conditions on cable carrier systems, it is of 
interest to compare tail values of the resultant cumulative distribution 
function (CDF) with the gaussian distribution. The median exponential 
intercepts the gaussian at an x value 12.6 dB above the rms. This cor- 
responds to the log™* (12.6/20) = 4.260 point. If the noise amplitude 
were truly gaussian, only 0.004 percent of the waveform would lie beyond 
the +-4.26¢ points. However, 0.0134 percent of the area lies below the 
exponential portion of the density function, nearly a full order of magni- 
tude difference. This sheds some light on the predicted performance of 
data systems, for instance, in the presence of gaussian noise, a typical 
analysis situation, and that actually observed in a working version of 
the system over real channels. 


* Hach “sample” is 30 minutes of time. 
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Fig. 7— Amplitude probability density functions. Estimated average and 
median taken over constant density values and derived from mean parameter k 
and median parameter c, compared to gaussian. Averages taken over 10 hours 
of noise on cable carrier systems. — gaussian; —-— average; —-—-— median, 
—--— mean k, median C 


3.2 Histograms on Microwave Radio Systems 


The data for the microwave systems were analyzed in the same way 
as the second method for the cable systems. The individual values of 
k and c were not computed because of the dubious value of such an 
effort. Averaging over the intercepts for constant values of density 
(method 2) illustrates the greater variability in the microwave systems. 
The results, presented in Table IJ, show this by the larger estimated 
standard deviations and wider 90 percent confidence intervals about the 


TasBLeE I—EsTIMATED PROBABILITY DENSITIES FOR CABLE 
AND COAXIAL CARRIER SYSTEMS 


Probability (aey 8 Median | 90% CI 
Density dB) (dB) (dB) (dB) 
10-5 19.3 4.4 15 1.5 
10-6 24.6 4.9 23 1.8 
1077 30.2 5.5 28 2.0 
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TABLE JJ—EstTIMATED PROBABILITY DENSITIES FOR 
Microwave Rapio SyYstEMs 


Probability (ee 8 Median 90% CI 
Density dB) (dB) (dB) (dB) 
10-4 17.7 10.7 12.6 5.5 
10-5 23.2 12.8 18.5 5.2 
10-6 24.7 9.7 22 4.1 
10-7 29.6 9.4 27 4.1 


estimated means (compare with Table I). The median function so 
derived is shown in Fig. 8. The distributions of intercepts were again 
found to be closely approximated by the log-normal distribution and 
the median curve examined as for the cable carrier systems. The median 
exponential intercepts the gaussian distribution at 11.6 dB above the 
rms value. This corresponds to log’ (11.6/20) = 3.8c, or 0.0165 percent 
of the noise waveform that would lie beyond +3.8c of a gaussian dis- 
tribution. The values of k and c for the median curve on Fig. 8 are 0.48 
and 0.042. Integration of the resultant exponential function over the 
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Fig. 8— Amplitude probability density functions. Estimated median taken over 
constant density values from 8% hours of noise from microwave radio systems. 
— gaussian, —-— average, ——-—- median. 
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appropriate intervals yields 0.068 percent of the median waveform in 
excess of the 3.8 points of an assumed gaussian. In this case, the median 
difference is about a factor of four. 


IV. FORMAL DEFINITION OF AN IMPULSE AND SOME PULSE LENGTH DATA 


In the process of impulse noise analysis a formal definition is required. 
The definition is illustrated in Fig. 9 and was first proposed by Kaenel, 
and others.° The waveform illustrated in Fig. 9 represents an ideally 
rectified noise waveform being sampled by an A/D converter. All 
portions of the noise waveform that remain below a variable slicing 
level, designated level 2, are considered as part of the underlying band- 
limited white gaussian or background noise until level 2 is exceeded. 
Once level 2 is exceeded, the noise pulse, or impulse, is measured starting 
at the point where level 1 was exceeded as indicated in the figure until 
it returns below level 1 and remains for a specified amount of time 
referred to as a guard interval. The function of the guard interval is 
to distinguish between nodes of a single impulse and two impulses which 
occur close together in time. Various guard intervals have been used in 
the analysis of voiceband impulse noise, from 0.3 ms to 0.8 ms. The 
choice is somewhat arbitrary, but on the basis of the author’s unpub- 
lished interpulse time distributions, his choice is 0.6 ms as an optimum 
value. This is preferred because interpulse gap length histograms com- 
monly exhibit a null at about 0.6 ms. The adjustment of levels 1 and 2 
vary, but level 1 is typically 10 dB above the rms value, and level 2 
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Fig. 9-—Ideally rectified noise waveform illustrating definition of pulse length. 
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from 18 to 16 dB above rms. In the context of this formal definition, 
the impulse has been referred to as a burst.° 

Under the rules of the definition, just given, frequency functions were 
constructed for the lengths of several thousand impulses. A set of these 
are shown in Fig. 10; the set is shown as “envelopes of all the observed 
frequency functions.” Only two points appear to be significant. The 
modes of the functions occur at about 1.2 ms, and lengths in excess of 
10 ms are almost never observed. The remainder of this paper discusses 
an impulse noise model. 


V. A MODEL FOR IMPULSE NOISE ON TELEPHONE CHANNELS 


This section describes impulse noise as viewed on a trunk group as 
it is used by a switched network subscriber. The distributions of the 
peak amplitudes of individual impulses have been of interest for some 
time, and extensive data concerning them have been collected.*~** 
Methods of relating such distributions to data system performance have 
also been derived.* 

The data are most frequently collected by means of simple threshold 
detectors. Excursions of the noise waveform above the threshold are 
recorded on electromechanical counters.” Such measuring devices have 
finite counting rates which may be exceeded at times by the rate of 
occurrence of impulses in clusters. For this reason, from this point on, 
“counts” referring to values recorded by the instruments will be used 
instead of the word ‘“‘impulse.’”’ The count process necessarily differs 
in some respects from the impulse noise process for the reasons just 
cited. 


5.1 Terminology and Definitions 


Some jargon has accumulated in the area of impulse noise studies; 
it is sometimes conflicting as well as confusing. The following termi- 
nology is adopted here. 
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Fig. 10 — Envelopes of length density functions derived under definition of Fig. 9. 
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(4) Count—Refers to a number registered on the counter of an im- 
pulse noise threshold detecting type of measuring instrument, set at a 
specified level, during a specified measurement interval. (The count 
may be less than the actual number of impulses which exceeded the 
measurement threshold during the interval because of the finite maxi- 
mum counting rate of the instrument.) Upper case C denotes the random 
variable count. 

(it) Impulse Noise Level—A level, expressed in decibels, at which 
the recorded count in a specified measurement interval is equal to 
some specified count denoted C, . 

(22) Level Distribution—A distribution of levels, expressed in decibels 
(dBm, dBrn, and so on), taken across a number of channels, at which a 
specified count C, is recorded in a specified measurement interval. 
Script ‘¢’’ denotes the random variable level. 

(iv) Count Distribution—A distribution of counts observed in 
measurements on a number of channels taken at a specified level. 

(v) Log-Count Distribution—A distribution of the logarithms of 
counts, expressed in decibels. Upper case “‘D’’ denotes the random 
variable log-count and is defined: D = —10 log,, (C/C.), where C, is an 
arbitrarily ‘‘specified reference count” greater than zero. C’, is arbitrary, 
but once picked it must be held constant for its associated level distribu- 
tion. 

(vi) Amplitude Distribution—A cumulative distribution of the peak 
amplitudes of individual impulses on a single channel. The average 
complementary distribution is linear on semi-log paper for counts in 
the range of interest; that is; C < 800 in 30 minutes. 

(viz) Slope—When spelled with “‘S’’, Slope refers to the slope of the 
peak amplitude distribution. Through common usage, the number 
assigned to Slope is the negative reciprocal of the slope of the peak ampli- 
tude distribution, designated ‘‘m’’, and expressed in decibels per decade 
of counts. 


5.2 General Comments on Level and Count Distributions 


Sample level distributions are constructed from data obtained through 
the use of multilevel impulse counters which record the number of 
counts at several levels, each separated by 2 to 6 dB, occurring during 
a prescribed measurement period. Level distributions may be con- 
structed from the data depending upon the specific number of counts 
C, in which one is interested. Suitable interpolation between the levels 
actually observed permits one to estimate the level at which some 
specified number of counts C, actually occurred. Thus each level dis- 
tribution has a number C, associated with it, as well as a specific meas- 
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urement interval. The primary data used in this study consists of level 
distributions of 15 counts in 15 minutes and 90 counts in 30 minutes.*”® 

The number of counts observed in a multilevel measurement tends 
to decrease exponentially as the level, in decibels, increases. Some de- 
partures from this rule are observed in individual measurements, but 
the average amplitude distribution taken over a large number of mea- 
surements in a single class of trunks appears to be exponential.° The 
number of counts C, at any level £, may be estimated from the number 
of counts C’, at level £’, by the empirically derived relation 


C = C’ exp [(’ — £)/(Mm)]. (2) 


where M = (log, 10)~*. Because different types of transmission facilities 
exhibit different impulse noise properties, the average noise level and 
average Slope vary over an appreciable range as facilities change. 
However, within a given type of facility or within a class of trunks, 
greater homogeneity is observed.° The model is therefore directed at a 
description of the noise as observed within populations of transmission 
channels on a single type of facility which are common to some larger 
grouping such as a trunk group. 


5.3 Assumptions 


The following two assumptions, supported by studies of available 
noise data, are basic to the model which is presented in Section 5.4. 


(z) Level distributions for a specified count C’, are normal with mean 
£, and standard deviation o,. 

(tt) o- 1s independent of C, within a given trunk class. The first 
assumption is the most reasonable in view of the data; there are con- 
flicting data concerning the second and it appears to be more valid for 
compandored facilities than for noncompandored facilities.‘ Under 
these assumptions, and one more stated below, it is shown below that 
the count and level distributions are completely described by the 
parameters associated with one level distribution: C,, f,, o,, and the 
Slope m. The Slope is estimated by the straight line connecting the mean 
of the level distributions for different choices of C, .° 


5.4 The Model 


Any number of level distributions may be obtained from the data by 
choosing different values of C,. As C, increases, the corresponding 
level £, will decrease and trace a path in the count-level plane given by 
equation (2). A family of such level distributions form a probability 
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density surface above the count-level plane with normal cross sections 
parallel to the level axis. Such a surface is illustrated in Fig. 11. Under 
assumption (zz), lines parallel to the mean Slope are projections of con- 
stant probability density with the same functional form as equation (2). 
One of two cross sections may be taken which will define a probability 
density function. If the cross section is parallel to the count axis, a count 
distribution results. To see this more clearly, consider an experiment 
where impulse noise measurements are made on a group of similar 
trunks. A value for C, is chosen and the associated level distribution 
with mean ¢, is found. The distribution will be normal with standard 
deviation o, . Another value of C, is chosen and a second level distribu- 
tion is constructed. It will have the same standard deviation as the first. 
The experiment may be repeated any number of times to construct the 
family of distributions illustrated in Fig. 11. 

In the experiment just described, the noise level ¢ associated with C, 
was the random variable. Now suppose one wishes to let the count, or 
log-count, be the random variable while holding ¢ fixed. It is noted 
that equation (2) is the relationship between the means ¢, and C,. 
Assume for the moment that equation (2) holds completely and is indeed 
a fixed relation between the two possible random variables, ¢ and C. 
Equation (2) may be rewritten, with ¢’ = 4, = 0 and C’ = C, as this 
constitutes an arbitrary shift in the decibel scale to define ¢’ = 0: 
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Fig. 11— Probability density surface for the impulse nOise count process on 
trunk groups. 
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£ = —m logi (C/C.). (3) 
Define D = —10 log (C/C,). Then ¢ = Dm/10, and the log-count 
distribution, the probability that D is S$ some value z, is by assumption 
1, 
1 zm/10 
PDea = Pee cnjij=— = / exp (—@/22) dé. (4) 
o ¢(2)’ —0 


The density function f(x) is found to be 


IIA 


f) = ipp api HP [—m"a*/2000%];  — esas o. 6) 
Thus D is approximated by a normal distribution with mean zero and 
standard deviation op = 100,/m.* 

In the previous derivation, equation (2), a relationship between 
expected values was assumed to hold as a mapping between the random 
variables ¢ and C or ¢ and D. To check the validity of this assumption 
a second experiment can be performed on the data collected in the first. 
The level ¢ can be held fixed at ¢,, and the count distribution at f, 
obtained by interpolation as described earlier. The observed log-count 
distribution may be compared with that derived in equation (5). This is 
done in Section 5.5. 


5.5 A Check on the Model Using Count Distribution Data 


Figure 13 is an example of count distributions derived in three dif- 
ferent ways from a set of data consisting of 127 measurements on non- 
compandored carrier facility trunks 1,000 to 2,000 miles in length. The 
level distribution for these data, with C, = 15, is slightly skew, the 
mean is 6.129 dBrn and the median 61.8 dBrn. The count distribution 
at 61.8 dBrn, obtained by interpolation between levels measured, is 
shown by the circled points on the figure. A point-by-point mapping 
from the level distribution by use of equation (2) is shown, as well as 
the log-normal one predicted by equation (5). The coincidence of all 
three sets of data is striking. 


VI. THE TIME VARIABILITY OF IMPULSE NOISE 


An additional check on the validity of this model is provided by its 
implications in the time variability of the noise. To see this, one addi- 
tional assumption is made, and predicted and observed results serve to 


* As a matter of interest, values of op calculated from the 1964 Intertoll Trunk 
Survey (Ref. 6), are shown in Fig. 12. 
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Fig. 12— Values of standard deviation of log-count distributions F(D) as 
observed on Bell System trunks. 


validate both this additional assumption and the preceding model. 

Consider making impulse noise measurements on a large number of 
channels at a fixed level ¢, and recording the cumulative count on the 
ith channel C,; at times n7’, n = 1, 2, --- . Now assume that the ac- 
crued count is a linear function of time so that each total count C,; 
after time nT’ may be estimated by C,,; = nC,;, where C;; is the count 
in the first interval 7 on the 7th channel. If the same reference count 
C, is retained in the definition of D (log-counts), for all time intervals, 
then the mean value of D will increase as log (n) but the variance of the 
count distribution (as opposed to the log-count) behaves differently 
however, as shown by the following. 

Under the assumption that equation (2) holds as a mapping between 
£ and C, the distribution of C (counts) may also be derived: 


PIC Sy) = Pl = —m log (y/C.)] 





Eek. tal i __ pyo2 
= Fant dom weve, exp (—£°/20;) dé, (6) 
and the density of C is approximated by the log-normal: 
= Mm =9 | en 2 | < 
fy) = (amt 4 &%P dgz In Y/Co) | 
OSyS %, 
In = log, . 
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Fig. 13 — Empirical verification of the distribution of counts as derived from 
distribution of levels of constant counts. Distribution of counts in 15-minute 
measured. periods on one class of trunks. © as measured; A constructed from 
level distribution ; — predicted from equation (5). 


The rth moment of C is then found to be 





2.2 
BIC] = C2 exp b'/(4a)); a = ew OP, @ 
On Op 


and the variance, 2 = C2(e’”* — e’’**) = C?A. Now, as the measure- 
ment interval is increased as above, C, is replaced by nC, and o¢(nC,) = 
n’C?A. The variance of the count distribution increases as the square 
of time if the mean increases linearly. 

Note from equation (7), that the mean of the count distribution is 
not equal to the reference count C, which is associated with the level 
distribution. The two are related as* 


(Chae = C. exp [o7/(2m*M”)] 
= C,(1.027)’. (8) 


Thus, the mean of the count distribution at level £, is always greater 
than the reference count C,. Furthermore, from the definitions of the 


* Note from Fig. 12 that op may be as large as 13.5 so (C)ay may be as large as 
130 times C.. 
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level distribution and the quantity D, C, is equal to the median of the 
count distribution. Solving equation (8) for op yields 


op 8.7 (loz (C)./C,)}, 


and an estimate of the variance of the log-count distribution may be 
made from the mean and median of the count distribution. This rela- 
tion should be very useful in practice. 

Now consider measurements of length K,7' taken on a number of 
channels with the counts recorded after K,7 and K.T, K, > K,. Let x 
be a random variable that takes the value of the count at K,T, and y 
one that takes the value of the count at K.T. If it were true that the 
count on each channel is a linear function of time, then for the ith 
channel measurement, y; = (K2/K,)x; and the coefficient of correlation 
pPzy = 1. Such correlation coefficients were calculated for several sets 
of data. The results are presented in Table III. 7’ was equal to 5 minutes 
in all cases. The notation p;; indicates the correlation between the counts 
at the end of 7 5-minute intervals with that after 7 5-minute intervals. 
The mean ratio of the count after 7 intervals to the count after z intervals 
and the ratio of the variance after j and 7 intervals is also given. The 
expected values, derived from the model, are given in each case (in 
parentheses), as well as the observed values. While the correlation coef- 
ficients are not all as close to unity as one might hope, especially for the 
5-minute versus 30-minute measurements (¢ = 1, 7 = 6), the mean and 
variance do appear to increase directly and as the square of time re- 
spectively. 

On the basis of the data shown in Table III and Fig. 18, the model 
appears to be an adequate description of the observed behavior of the 
impulse noise on transmission facilities as viewed through impulse noise 
measuring sets. 


TasuEe ITJ—CorRRELATION COEFFICIENTS AND Ratios 
oF MEANS AND VARIANCES* 


Sample 
1,9 piz By /pi 3;?/si2 Size 
1,2 (1) 0.87 (2) 2.04 (4) 3.70 87 
1,2 (1) 0.92 (2) 1.97 (4) 5.00 76 
1,2 | (1)0.90 | (2)1.98 (4) 3.98 216 
1,3 | (1)0.96 | (3)3.10 (9) 9.90 161 
1,3 (1) 0.98 (3) 2.90 (9) 8.50 168 
2,4 (1) 0.97 (2) 2.06 (4) 3.60 93 
1,6 (1) 0.58 (6) 6.76 (86) 46 161 


* For counts observed after 7 and j 5-minute intervals. Predicted values in 
parentheses are followed by observed values. 
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VIT. SUMMARY 


The following relations and conclusions come from the model pre- 
sented and the data upon which it is based. 


(i) Level distributions are normal with mean ¢, and variance o7. 

(77) Count distributions are log-normal with mean (C),, which is 
linearly related to the length of the measurement interval, and variance, 
o¢, which is proportional to the square of the interval. Equivalently, 
log-count distributions are normal with mean proportional to the 
logarithm of the measurement interval and variance, o5 , independent 
of interval. 

(77t) o, is dependent upon the class of trunk but is independent of 
C, , an arbitrary reference count greater than zero. 

(tv) op = 10c,/m, m is a measure of the slope of the distribution of 
noise peak amplitudes. 

(v) op & 8.7 (login (C)av/Co)?. 

(vi) (C\ay & C,(1.027)7>" 

(vit) The mean of the count distribution, (C), = C,e'’“” and the 
variance 





2 2 
V(C) = C%h/ (20) [eve ih = us ; HT = log, 10. 

(viii) The median of a count distribution, taken at level ¢, , is equal 
to C, and the mean (C),, , may be 100 times C, . Expected count by 
itself is accordingly a very poor statistic for describing impulse noise. 
However, the mean and the median completely describe the count or 
log-count distributions. 

The model helps to explain the apparent erratic behavior of impulse 
noise measurements. Any measurement is a sample taken from the bi- 
variate sample space illustrated in Fig. 11. The fact that the distribu- 
tion of counts is log-normal also accounts for the great fluctuation in 
the count observed on successive measurements on a given channel. 
It is shown however that the average rate of occurrence is reasonably 
constant with time for intervals from 5 to 30 minutes. 
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The Determinability of Classes of 
Noisy Channels 


By LEONARD J. FORYS 
(Manuscript received June 30, 1969) 


This paper is concerned with the identification of a fairly general class 
of nonlinear operators using corrupted measurements. A precise mathe- 
matical definition of identification 1s presented and the relationship between 
a priort information and identification is studied. The a priort information 
is represented as a subset of a metric space of nonlinear operators. Neces- 
sary and sufficient conditions are developed to answer the question ‘‘When ts 
identification possible?” 


I. INTRODUCTION 


A large body of literature already exists for the problem of identifying 
a control system or communication channel with noisy measurements. 
In the usual identification problems, a certain structure is assumed at 
the outset in order to reduce the identification problem to one of param- 
eter estimation. The absence of such parametrization increases the 
difficulty of the problem substantially. It is often not clear if identifica- 
tion is even possible. 

In this paper we are concerned with the determinability (identifi- 
ability) of quite general nonlinear operators whose outputs are corrupted 
by additive gaussian noise. We introduce a norm on this space of non- 
linear operators and define precisely what we mean by determinability. 
Loosely speaking, we say that we can determine an operator H if we 
can choose a finite observation interval [0, 7], a test signal with con- 
strained peak value over this interval, a finite set of linear measurements 
over [0, 7], and an estimate H of H which is a continuous function 
of our measurements such that 7 is close to H in norm with high proba- 
bility. 

The question of determinability is of course intimately related to the 
kind of a priori knowledge one has of the operator. We represent this 
a priorz information by saying that the operator H belongs to a subset 
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®D of possible operators. We derive conditions on D which are sufficient 
for determinability. We also show that most of these conditions are in 
fact necessary for the determination of H. 

Our results are motivated by the work on the determinability of 
noiseless channels done by Root, Prosser, and Varaiya.’~* They derive 
necessary and sufficient conditions to estimate a noiseless channel 
closely with a “one-shot” experiment. These conditions are similar to 
those presented here. Some work on the noisy problem has been done by 
Root.’ His approach and results are fundamentally different than those 
presented in this paper. Root investigated a class of stochastic nonlinear 
operators represented by a Volterra series whose kernels are gaussian 
random variables. He derived necessary and sufficient conditions for 
the second moments of the kernels to be determinable. 


II. PRELIMINARIES 


The types of channels to be considered can be described as follows. 
The input signal + and observed signal w are related via the operator 
equation 


w(t) = [Hz] +2) te (0, ~) (1) 


where H is an operator and z is zero mean white gaussian noise’ with 
covariance Hz(t)z(r) = 6(¢ — 7). (The colored noise case will be treated 
separately in Section V.) 

We constrain our input functions x to have peak value less than s, 


+ The noise term 2(¢) in equation (1) must be interpreted symbolically since white 
noise cannot be parametrized with a time variable, but must properly be param- 
etrized with an element of a space of “testing functions.’? However, we deal only 
with functionals of w(t) of the form 


[woe a 


where ¢ « L2(0, 6), or with quantities derivable from these functionals. Hence we 
can define 


i " alo(f) dl 


to mean 


[ sma 


where g@ is Brownian motion and the operations to be performed are readily 
justified. 
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that is, . 
zeL,(s) = {x | is a real valued measurable function on [0, ©) 
and | x(t) | < sfor allée [0, ~)}. 


If we let |] ||. denote the norm on L,[0, ©) and define the projection 
operator Pz by 


[P-z|\() = 2) for ts T 
= 0 for ¢>T 
then 


| Pre |, = (f° x°(t) at) <s(T)' forall xe L.(s). 
The types of operators which we consider are assumed to belong to 
the space 3. The space 3¢ is defined: if H ¢ 5¢ then 

(it) H: L,(s) > La, 

where L,, = {y | y is a real valued, measurable function on 

[0, ~), || Pry |l2 < © forall T > 0}, 

(2) H is causal; that is, for all 7 > 0, ae L..(s), PrHx = PrHP7rx, 
(iit) || H || < . 


Using the usual definitions of addition of operators and multiplication 
by scalars, the norm of H, || H || is defined as: 


PH |\2 
Hi\|= su f . 
i «Eee re 
zeLe(s) 
||P rz||20 


We consider H to be the zero operator" if || H || = 0. It is then easy 
to show that || || satisfies the norm axioms. Obviously || H || = 0 for 
all H « # and || AH || = || || W || for all scalars \. The triangle in- 
equality is also satisfied since 


|| H+ K || =sup ee + Kz) ||. ap P,Hxz + PrKz ||. 


P rx ||» || Pra ||o 


t The equivalence classes defined in this manner are not unreasonable. In fact, 
if Hl H || = 0 then || PpHz ||2 = 0 for all ze L,(s), || Pra |l2 4 0 and all T > 0. 
As far as we are concerned this is the zero operator since Hz is then the zero function 
in the £2(0,  ) sense. 
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lA 


| P,Hz ||, + || PpKx | 
or || Pre ||; 


P,Hx [| PrkKa {I> 


= on LPoHe [|p 
= UP Pe lee Pee lle 


where the supremums are taken over all 7 > 0, xe L.,(s), || Prv ||, ¥ 0. 

If we consider the metric induced by the norm || || then 3¢ is a com- 
plete metric space. The proof of this proposition is contained in the 
appendix. The completeness property is crucial to Theorem 2 of this 
paper. 

The space 3C includes many types of operators familiar to those in 
communication and control theory. Linear time invariant convolution 
operators whose kernels are either in L,(0, ©) or L.(0, ©) are in &. 
If these operators are cascaded with a memoryless nonlinearity having 
bounded slope, the composite operators are also in 3C. Operators des- 
cribed by certain nonlinear dynamical systems are also in 3C. Let wv « 
L..(s) be the input to the following dynamical system and let y be the 
output: 


= || A || + [Kl 


git) = fq, 2, ), — q@) = 0 
f:R°XRXR-R 
y(t) = g(q(d)) 
g:R"-R 
with 
lW@|<Kilgl 11@%,0|SKelqg\+Ks|o| 


for all ge R”, |x| <s,t > 0. Assume also that for each xe L,,(s) there 
exists a solution to the differential equation. Then, via the Bellman- 
Gronwall inequality we see that 


| q(t) = Ks [ es | a(r) | dr. 


([" |g at) < KK, ie =, é 


([" ly | at) < K,K.K; || Pre |\e . 


Hence, 


and 


Thus, the operator described is in 3 with norm bounded by K,K.K;. 
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Subsets of 3C will be used to represent the a priori information in an 
identification problem. We call a subset D of 3C determinable if every 
member of © can be identified. The determinability of a subset D de- 
pends of course on our definition of identification. We would like to 
consider only those identification procedures which could theoretically 
be implemented in real time. The identification procedures which we are 
concerned with must have the following properties. To identify H we 
must be able to 


(z) choose a finite observation interval, 
(c7) select an input function with constrained peak value, 
(277) perform linear measurements on the noisy observations generated 
by this input, and 
(7v) operate on these measurements to yield an estimate of H which is 
a continuous function of these measurements, 


so that our estimate of H is close to H with high probability. 

The properties of such an identification procedure are physically 
very appealing. We obviously must be able to identify within a finite 
period of time. The peak value restriction is the usual kind of input 
constraint used in communication theory. Linear measurements are 
easily implemented and tend to reduce the sensitivity to unknown 
biases as does the continuity requirement on the estimate. Finally, 
we are usually satisfied to identify to within a small tolerance. 

For H « % and channel model given by equation (1) we may specify 
our definition of identification even further. A linear measurement over 
the time interval [0, 7] is a finite collection of bounded linear func- 
tionals' (p;, w), 7 = 1, 2, --- N, p; ¢ L.[0, T] defined when Prw e 
L.[0, T] and 

wt) = (Hz]O® +20, OStsT 
is the received waveform with H « #, xe L,(s). We say that a class 
® C & of channel operators is determinable if given arbitrary positive 
constants « and y, there exists a finite observation interval [0, 7], an 
input (test) signal ve L,.(s), a linear measurement [(p; , w), (p2,W), °*° , 
(py , w)] over [0, 7'], and a continuous function g : RY — 3¢ such that 
for each H ¢ 9, . 


Probability (|| H — A || > —) <7 
where H = g[(p,,w), (p2,w), ---, (py , w)]. Thus, if D is determinable, 


+ The symbol (f, h) is used to represent the inner product in L2[0, 7]; that is, 
(f, h) = So" f(t)h(e) dé. 
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we can “identify” any element of D to within any specified accuracy 
with sufficient processing and long enough observation time. 

The bulk of this paper is related to answering the following question. 
What structure must D have in order to be determinable? Theorem 1 
derives sufficient conditions on ®D in order to be determinable. The key 
condition is compactness. Theorem 2 indicates that this condition is in 
fact necessary for determinability. A number of corollaries are given 
which interpret these results for the case where D is composed of linear 
convolution operators. 


Ill. SUFFICIENT DETERMINABILITY CONDITIONS 


Despite the generality of our class of operators and the rather rigid 
nature of allowable identification schemes only two conditions guarantee 
the determinability of a subset of operators. Both conditions are some- 
what obvious. One condition insures that the class may be approximated 
closely by a finite number of elements; the other insures that a test 
signal exists which will produce sufficiently dissimilar responses for 
dissimilar channels. These conditions are rigorously stated in Theorem 1. 
Theorem 1: Let D be a subset of 3C having the following properties: 


(1) the closure of D is compact (thus D ts also bounded; that is, there 
exists a constant R > 0, such that || H — K || < R for all H, Ke SD) 

(it) given any 6 > 0 there exists an unbounded sequence {T;}, a sequence 
of inputs x; « L,,(s) and a positive number r such that 


| Py, (Hx; — Kx;) [|> > rT; 


for all pairs H, Ke 9 for which || H — K || = 6. Then D is a determinable 
subset of 3C. 

Proof: Since the proof of this theorem is lengthy, we give here a brief, 
rough description of the key steps involved which the reader may use 
as a guide through the mathematical details. 


(t) Using (27) of Theorem 1 we select an input 2; to give sufficient 
separation of outputs over [0, 7;] for sufficiently dissimilar channels. 

(ii) We then approximate the class D to within a judiciously chosen 
accuracy by a finite number of elements. 

(zt) The actual received signal due to the input selected in (2) of 
this proof is correlated over [0, T;| with the calculated outputs of the 
channels selected in step (7) of this proof. 

(iv) If one of these correlations is larger than the shor by some 
amount we select as our estimate the corresponding element of the 
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approximating class that yielded this correlation. If there is no such 
correlation we assign an arbitrary rule so as to make the identification 
procedure a continuous function of the correlated values. 

(v) We finally show that as 7 (and hence 7';) increases, the proba- 
bility that there will not be a correlation larger than the others by some 
prescribed amount goes to zero. In addition, we show that the proba- 
bility that our identification procedure yields an estimate which is 
further apart in norm from the actual channel than is desired is vanish- 
ingly small as 7 increases. 

The formal statement of the proof follows below. 

We may assume that D is closed, since subsets of a determinable set 
of channels are determinable. Using assumption (iz) of Theorem 1 
with 6 = 3/4 we have that there exists an unbounded sequence {7';}, 
a positive number 7, and for each 7 an input signal x; « D..(~) such that 
for all pairs H, K e D with || H — K || > 36/4 


| Pr,(Hx; — Kz;) ||} = rT;. (2) 


In what follows we will denote the operator which we wish to identify 
by H. Since 9 is closed, by assumption (2) of Theorem 1, it is also com- 
pact and hence totally bounded (see for example Ref. 6, p. 22). There- 
fore, given any 7; « {7} we can choose a finite number of balls of radius 
ro = min {r?/2s, ¢/4} with centers H, ¢ D, a = 1,2, --- , M to cover D. 
There may be operators H, , H,e« {H..} for which || Pr,(Hiz: — H,2:) ||. 
= 0, in which case retain only the H,’s with the lowest subscript. Thus 
we have a subset of {H,} which we label {Hg} for which || Pr,(Hix; — 
H,x;) ||. > 6; > O for some 6; and all H, , H, « {Hs}. For convenience 
order the {Hs} so that || H — Hg || S 3¢/4forB = 1,2,---,No— 1 
and || H — H,|| > 3e/4for8 =N,,N,.+1,---,N,N SM. 

We can now choose an appropriate linear measurement over the 
interval [0, 7’;,]. We define the linear measurement m(w) = {f(w, 1), 
Ce fw, N)}: fw, 8) = (w, 2H pv;), 8 = 1, 2, -+- N where the 
inner product is defined over the interval [0, 7';]. Thus for each received 
waveform w(t), the linear measurement gives us a point in R”. From 
this measurement we will determine an estimator function g : RY > 3. 
We first partition R” into N + 1 disjoint subsets: A,, Az, °::, Ay, B, 
with 


A; = {@ = (41, a, °++ dy):a; — a, > (Hx; , H;2;) 
— (Ax; , Ayx;) + 6:/T; , k= 1,2,-:- N, k # 3} 


and B the remainder of R”, 
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N c 
B=(Ua4,)- 
7=1 
The disjointness of the above subsets of R” is easily verified by making 


use of the fact that 6;/7; > 0. The estimator function is defined in 
terms of this partition: 


g(m) = H; if m(w)e A; 
N 


g(m) = doa(w)H; if m(w)eB 


i= 


where? 
“) = Taw), A) + Tames), A) 
and 


d(x, A) = inf |x — y |. 
yea 


It is not difficult to show that g is a continuous mapping from R” into 
5x. Having given the identification scheme we now show that for any 
HeX,e>0 


Pi\l|H-HA\|>e > 0. 


Tivo 


Recalling the definition of B, A; and the labeling convention we have 
used, we see that 


Pt|| H — g(mw)) || > ¢} S Pim) e BS + PY mw eV A;} 


= Pim(uye (\ ai + Pimiaye Yah @ 
Let us first concentrate on obtaining bounds for the first term on the 
right side of equation (8). We rewrite A; as A; = (\U,,; Fix)’ where 
Py, = {@ = (@,,@,-++ , Gy): a; — a & (Aja; , H;2,) 
— (A,x,; , Hix;) + 6,/T;}. 
Thus 


+ It turns out that the form of a;(w) is irrelevant since we show that Plm(w) « BY 
vanishes as 7’; increases. It is merely included to make the estimator function 
continuous. 
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N 


N 
(VAi = (VU Fin). (4) 
7=1 7=1 k¥i 

Applying DeMorgan’s rules to equation (4), and after some thought, 
we see that 

N (N-1)% 

NV4;= U D, (5) 

j=1 =1 
where D, has the form 

D, = Fu, (\ Fay, (\ ee ae Fyix 


with J; ~ 7 for all 7. We can upper bound P{m(w) « D,} by 





pe pa = (H jx; , H;2;) = (Hx; ,A,2;) S fiw, k) — fw, )) 
aor : 
< NO6; 
= 7. 


To see this, define g(w, k) = f(w, k) — (A,2; , H,x;). Then P{m(w) ¢ D,} 
is the probability of the N events q(w, 1) — q(w, l1) S 4:/T; , q(w, 2) — 
q(w, lz) S 0,/T; --- qu, N) — qu, ly) S 6;/T; occurring simul- 
taneously. Suppose 1, = k. Then consider the two events q(w, 1) — 
q(w ’ I) e q(w, 1) = qu, k) s 0;/T; and qu, k) ~ qu, lx) s 0;/T; : 
If 7, = 1, then these two events are contained in the event —0;/T; S 
g(w, 1) — q(w, k) S 0,;/T; . If l, = 7 ¥ 1 then consider the three events 





(Ag Hee) (Hx; |Hyx)}- (6) 


q(w, 1) — gw, k) S 6,/T; , 


qw, k) << gw, ) Ss 0;/T; ’ 
qw, i) _ qw, I) = 6:/T; . 


If 1; = 1, then these three simultaneous events are contained in the 
event —06,/T < q(w, 1) — qu, j) S 26,/T; . If l; = k, then these three 
simultaneous events are contained in the event —@;/T S q(w, k) — 
q(w, j) S 0;/T; . Continuing in this fashion we obtain the bound in 
equation (6). 

Since q(w, k) — q(w, j) is gaussian, we can bound the value of the 
expression in equation (6) quite easily. 

Let 


any = E[qw, k) ~~ q(w, 7] 
= || P7(Ha; — H;x) ||} — || Pr(H2: — Aix, ||: (7) 
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and 
ti = Var [q(w, k) — q(w, j)] = 4 || Pe,(Hia; — Hjx;) |\2 > 403. (8) 


ae 





py—Ne < g(w,) — aw. S ca 


NOs/Tiokj—aky/ok; 2? 
= (20)? i exp (-4) dz 


—(N6i/ Tickj)—aki/ok; 


NOi/T ior; 2 
< (2n)? i exp (—2) dz 


—(N6i/T oR 7) 


: N/2T; 2 M/2T: 2 
S (20) i exp (—5) dz < (2n)? [ exp (-4) dz (9) 
—(N/2T;) = '—(M/2T3) 
(recall that N < M). 
Using equations (9) and (5) we see that 


(N=-1)% 


P{m(w)ef\ Ai} S DD P{mw)e D,} 


t=1 l=1 


< (M — 1)"@ny* | ne (-4) dz. (10) 


—(M/T;) 


IA 


Since the right side of equation (10) goes to zero as 7’; increases we can 
choose a 7's {7} large enough so that this term is less than 1/2. We now 
bound the second term on the right side of equation (8): 


PY m(u) 2U 4,} = ¥ Plmw)e Ay}. (1) 
Recall that 
=(U Fin = (Vln. 


Hence 


PY m(u) e U A }= 2 P{m(w) e () Fiz}. (12) 


i=No =No 


Observe that for all k # j 
P{mw)e (\ Fi} S P{mw) e Fix} (13) 
kei 


= PA at i) — gw, k) > a (14) 
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mi 1 
= Va, caren Oat exp (— ©) ae (15) 


Since D was covered by balls of radius 7, , there exists at least one inte- 
ger k < Ny such that || H — H; || < ro S ¢/4 and hence || Pr,(Hz; — 


Hyx;)||? < ris’; . Note also that since || H; — H || > 3¢/4forj = No, 
|| Pr,(Hx; — H;x;) ||? > rT; . Hence, 


— A; = || P7(Hx; — H;2;) ll2 re [| Pp (Hx; — Hyx;) I|2 
>rT, —rsT, = ( — lr, = 3/7, . (16) 
Recalling that D was bounded, 


rE = 4 || Py, (A;x; == Ayzx;) I|3 < AR?’S’T;, . (17) 
Using equations (16) and (17) in equation (15) we see that 


io) 


(24) "* exp (- z) dz. 


(18) 


Pim(u)e (\ Pin} S Pl mw) e Fis} | 


rTyt/16Rs 


Hence from equation (11) we see that 


PY m(u) £ U A ; <M / (2n)~* exp (— “) dz. (19) 
7=No 3r7;4/16Rs 

Thus we can select a 7 « {7} so that this term is less than af? 2. This 

T makes P{||H — H || > e} < 7 forall He 9. 

The identification technique proposed in the above proof | is not 
necessarily a practical technique. Our intent is to indicate the possibility 
of identification rather than to derive easily implementable techniques. 
Notice, however, that since the measurements are linear functionals on 
L,(0, 1) they are iterative in nature because of the integral representa- 
tion of such functionals. 

Theorem 1 gives sufficient conditions for determinability. Theorem 2 
indicates that some of these conditions are in fact necessary for identi- 
fication. 


IV. NECESSARY DETERMINABILITY CONDITIONS 


In this section we show that the approximability condition given by 
condition (z) of Theorem 1 is in fact necessary. We also show that a type 
of separation property is necessary, although it is not as strong as that 
given by condition (77) of Theorem 1. 
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Theorem 2: Let D be a bounded, determinable subset of 3C, then 


(t) the closure of D ts compact 
(21) given any 6 > O there exists an Xe L,(s), T > 0 and a positive 
number r(6) such that 


|| Pa(H& — Kx) ||} > r(6) forall H, Ke D 
satisfying || H — K || = 6. . 
Proof: (tz) Given e > 0, choose 7’, e L,.(s), N linear measurements and 
an estimator g(m(H, w)) so that? 


P{|| H — g(m(G, w)) || < «/2} > 2 forall He®. 


Since © is bounded and the measurements are linear, there exists a 
compact ball Bz 2 R” so that 


P{m(H, w) 2 Bé} < } forall He ®. 


Thus, since g is continuous, g(Bs) is compact. We can therefore cover 
By by a finite number of balls of radius ¢/2. If g(Bz) D D we could 
also cover © by the balls. We don’t have enough information to verify 
that g(Bz) D D. Notice however that 


P{w: || H — g(m(H, «)) || > €/21.0 (o: mH, «) 2 Bal} 
Plo: || H — g(mUH, )) || > €/2} + Plo: m(H, a) e Bo} 
P{{o: || A — gm, «)) || > «/2] U fo: mH, «) © Bal] 
Q+g-1=4. (20) 


We conclude that there exists an w, so that m(H, wo) « Bz and || H — 
g(m(H, w»)) || < ¢«/2. We can repeat this argument for each H e« 2. 
Therefore, © must lie within an ¢/2 neighborhood of g(Bz). By expand- 
ing the balls of radius e/2 which cover g(Bz) by a factor of two, the 
expanded balls will also cover D. Since this argument holds for any 
e > 0, D is shown to be totally bounded. Since 3C is complete, © is 
complete; and hence D is compact (see Ref. 6, p. 22). 


IV 


(iz) If D is determinable, then the closure of D, 9, is also determin- 
able. This is easily shown by noting that any channel in 9 can be 
approximated arbitrarily closely by a channel in ©. Hence the measure- 
ments will be arbitrarily close and because of the continuity of the 
estimate, the estimate will be close with high probability. 


+ Since the measurements are gaussian random variables we have included the 
dependence on the sample points w of the corresponding sample space 0. 
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Since 9 is determinable, for every « > 0 there exists an observation 
interval [0, 7'], a test signal £¢ L,,(s), and an estimator g(m(-, w)) so that 


P{|| H — glm, »)] || < 6/2} > 2 forall He OD. (21) 


Suppose that || Ps(H#é@ — Ké#) ||, = 0. Then, the measurements ob- 
tained will be the same irrespective of whether H or K were used and 
therefore the estimates for K and H will be identical. Since 


Plo: || H — glm(H, w)] || < 6/2} > 4 
and 
Plow: || K — glm(H, w)] || < 6/2} > 4, 
we see that 
P{{w: || K — glmH, #)] || < 8/2} 
O {w: || H — glmH, #)] || < 6/2)} 
= P{w: || K — glmH, )] || < 6/2} 
+ P{w: || H — g[m(H, w)] || < 6/2} 
— P{[w: || K — glm@, «)] || < 6/2} 
U fw: || H — glm, a)] || < 6/2)} 
Site-1=}. 
Thus there exists at least one sample point w) such that 
|| K — g@n(H, )) || < 6/2 
and 
|| Z — g(m(H, @)) || < 6/2 


which together imply that || H — K || < 6. If H, Ke D and || H — 
K || > 6 then || Pa(H@ — Ké) ||, > 0. 

Note that  X 9D is compact in the product topology and hence 
C(6) = {(7, K): || H — K || = 6, H, Ke D} is also compact. The func- 
tion f(H, K) = || P#(H# — K®) ||, is a continuous map of C(6) into the 
real line and hence it has a minimum value. This minimum value cannot 
be zero because we have already shown that f(H, K) > 0 for (H, K) « 
C(6). As a consequence, there exists a positive number 7r(6) such that 


|| P(H& — K&) |[2 > r(8) forall H,Ke® 
satisfying || H — K || = 6. 


3278 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


V. LINEAR CONVOLUTION OPERATORS 


When we specialize the results of Theorems 1 and 2 to linear convolu- 
tion operators, it is possible to obtain the characterization of the deter- 
minable sets in terms of the kernels of these operators. These results 
are given in Corollaries 1, 2 and 3 below. We note that the resulting 
conditions are similar to those obtained by Root and Prosser for the 
deterministic identification problem.’ 


Corollary 1: If 3 is composed only of causal linear time invariant con- 
volution operators H, [Hx](t) = fj h(t — r)x(r) dr, he L,(0, ©) and if 


(i) © = {h|heL,(0, ©), He D} has a compact closure in L,(0, ~), 
and 

(tt) for each 6 > O there exists anxe L,(s), T > 0 such that || PyHx — 
PyKx ||. > 0 for allh, ke for which || h — k ||, = J3| h(t) — k(t) |dt= 
6 then D ts determinable. 


Necessary and sufficient conditions for © to have a compact closure 
are (see Ref. 6, pp. 298-299): 


(¢) D is a bounded subset of L,(0, ©), 
(i) lim,oo f3 | h(E + 7) — h(t) | dt = O uniformly for he D, and 
(dit) limp SP | h() | dt = 0 uniformly for he BD. 


Proof: We first show that if the closure of © is compact then the 
closure of D is compact in the respective topologies. Let || H ||*, H « x 
denote the usual operator norm, that is, 


A |/* = sup 1 Bele. 

zeLa(0,00) || & lle 
20 

Given any e > 0 there exists 7* > 0, x* « L,.(s) such that 


Py.Hae* la PalP ot [fo 
|| Prex* ||2 || Pra* ||2 


Note however that P;.2* ¢ L,(0, ©); hence || H || < e + || A ||* for 
arbitrary « > 0, so 


[|| Se+ Set (23) 


[| 7 || = || 2 ||*. (24) 
Using the linearity of H and Holder’s inequality we see that 


Wa it= sup Lele yy (Wb dete <a. (28) 


1 4p 
zeL,(0,0) || L |e > zeL,(0,00) || & {|2 
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Thus compactness in L,(0, ©) implies compactness in 3C and condition 
(z) of Theorem 1 is satisfied. 

Given 5 > 0, choose x,, T° so that condition (iz) of Corollary 1 is 
satisfied. We have already used the fact that || HP,yx, ||. S || h ||: 
|| Prox, |p. Hence HP;.x, is a continuous linear mapping (that is, 
mapping the kernels into time functions) from L,(0, ©) into £.(0, ©). 
Thus the image of D under this mapping has a compact closure. We can 
therefore choose a number 7 > 7” so that 


i (HP px, — KP x,t) di <4 forall H,KeS. (26) 
; | 


Define £ as follows: 


£(1) = xo(4) for 0< ts 7° 
= 0 for T°? <isTf 
=2(t—T1T) for P<tsT+T 
= 0 for T+ <i s ot 


=a(t—nl) for oP <t<n?4+ 7” 
= for nf+Tl?<t<sm4+vT 


(27) 


Note that ¢e L,,(s). Following the same line of reasoning as in the proof 
of condition (22) of Theorem 2 we can show that there exists an r(6) > 0 
so that || Pr.(Hx») — K2p) ||} > r(6) for all H, Ke D for which || h — k ||, 
> 6. We now proceed to show that 


|| P.ze(Hé — Ke) ||} > ¥(8)nT (28) 
where 7(9) = r(8)/4T. Let yo(t) = [HP pot) — KP poxp|(t) and y;(t) = 
Yyo(t — tT’). Then, by linearity and time invariance, 

if i? 
[ ,@e—Keyqa= fo QtnO+-- + yal at 
i-] (i-1)7 
(29) 


and 


G41) fF (441-7) TF 
| waa [wat tor ji. (30) 
iT ( 


t-7)T 
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Using these relationships we see that 


G41)? 
i (yo t +++ + y,)’ dt 


a 


(441) FP (i+1)F 

=f via —2f lui uelte + lyr Dae 
a iT 
G4? (i41)F 

=a vali —2 [- Wi +--+ oh ae| 
iT iT 
(i4+1)f (41) 

= iP Yi; ali - a Yo ar| = r(6)/2. (31) 
iT T 

Hence 


t of 
Poe —Ka)|2= f att [ota att 
0 T 


nf 
an Yo + ee + yn-1)” dt 
(n—1)T 
> nr(6)/4 = r'(dnT. (32) 


We see that this relation implies that condition (iz) of Theorem 1 is 
satisfied; thus D is determinable. 

When 3c is composed only of causal linear time invariant convolution 
operators we can also strengthen the conclusion of Theorem 2. This 
result is given in the following corollary. 


Corollary 2: If % is composed only of causal linear time invariant 
convolution operators and if D is a determinable subset of 5 then 


(1) gwen any 6 > O there exists an unbounded sequence T; , a sequence 
of inputsx;,2L,.(s) anda positive number r(8) such that || P-; (Hx; — Kx;)’ ||. 
> r(6)T; for all pairs H, Ke D for which || H — K || > 6. 


Proof: As a consequence of Theorem 2 we know that for any 6 > 0 
there exists an ¢¢ L,,(s), J’ > 0 and a positive number 7(6) such that 
|| Ps(H# — K#)’ ||. > (8) for all H, Ke D satisfying || H — K || > 6. 


Obviously, || HP#é ||. < || H || || Pet ||. . Hence HP sé is a continuous 
linear mapping from 3C into Z,(0, ©). Thus the image of D under this 
mapping has a compact closure. We can therefore choose a positive 
number 7 > T so that 


i (HPs4 — HP;8)*(t) dt <4 forall H,Ke®. (33) 
T 
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Proceeding as in the proof of corollary 1 we can easily establish (2) of 
Corollary 2. 


Corollary 3: If 3 is composed only of causal Hilbert-Schmidt operators H, 
[Hx](t) = f§ h(t, 7)x(r)dt, f% f% | h(t, r) |? dt dr < o, h(t, 7) = 0 for 
tT > tand zf 


() ® = {h|H « D} has compact closure in the Hilbert-Schmidt 
metric (\|h — k ||} = S$ f% | h(t, r) — k(t, 7) |? dt dr) 








(22)for each 6 > O there exists an unbounded sequence T; , a sequence 
of x, « L,,(8) and a positive constant r(5) so that || Pr,(Hx; — Kx;) ||, > 
r(6)T; for all h, k « D for which ||h — k |l2 > 6. 


Then D ts determinable. 
Proof: As in the proof of Corollary 1 we can show that || H || < || H ||* 


where 
| = sup Be 


xeL2(0,0) || x ||o 


From the Schwartz inequality we see that 


i Ul = x0) ee [ (f° h(t, 7)x(7) dr) dl 
[Lf trea tar [haem ear] ae 


[|e |lz Il 2 ile, (34) 
which implies that 


|| Ha ||2 


IIA 


IIA 


|Z || < [lAlle- (35) 


Hence, compactness of implies that © is compact and condition (7) 
and (22) of Theorem 1 are easily verified to hold. 


VI. COLORED NOISE 


Theorems 1 and 2 were derived for the case when z(t) the additive 
noise was a zero mean white stochastic process. The situation when 
Ez(t)z(r) = R(t, 7) can be handled in a similar fashion. The only addi- 
tional assumptions are: 


(1) R(t, 7) is positive definite; that is, 


/ i Ri, r)ww(7) dtdr > 0 forall we L,(0, 0) 
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satisfying J% | w(é) |? dt > 0, and either 
(it) RC, r) is Hilbert-Schmidt; that is, 


[ ff k@aPatar ae <0, o 
0 0 
(iii) if R(, 1) = Ro(t — 1) then 

[ [Ro Pat =< @. 


Inspecting the proof of Theorem 1, one sees that the whiteness assump- 
tion was only used in equations (8) and (17). If Ez()z(r) = R(t, 7), 
then equation (8) becomes 


on = Var [g(w, kb) ~ gv, 9)] 
<4 [0 [RG dees — Hadar. — Hed(o) adr. (86) 


Since H, and H, were chosen so that || P,,(H.v; — H;x;) || > 0, we see 
that since R(t, 7) is positive definite, o;; > 0. If we choose 6; to be less 
than min,;,, o;, instead of || Pr,(H.a; — H;2,) ||}, inequality (9) will 
remain true. 

Equation (17) is changed as follows. If Ez()z(r) = R(t, r), then by 
the Schwartz inequality 


I 


2. 
O;k 


a i I RE DE Ae OC = Haye arae 


IA 


Ti Ti 3 
4 j (H,t, — Hexd(o), [ | R(t, 7) ar} 
4 / ” (Hae, — Het)" a) dr 


< 44 f" a | R(t, 2) | dl ar} || Pe(Hie: — Hex.) [2 
< ACR’sT, . (37) 


On the other hand, if Hz(é)z(r) = Ro(t — 7), equation (17) is changed 
as follows. 


Ti Pi 
rk = 4 | / Rt = 7) (Hx; = Ayx,)()(A,2; = Hyx;)(7) dt dr 
a 0 


IIA 


1 
2 


4 [tia — Heed(0 | | [" iRe- oF ar | 
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Ti 3 
| (Hx; — Hex,)"(7) ar | dt 


A 


= 4C,Rs(T,)? [ | (A 2; aad H;x;)(b) | dt 


acaRs(t( . ut) | i ” je, — Hi) a 


4C,Rs(T;)*-(T,)? -Rs(T)? = CRs T? . (38) 


From equation (37) we see that the limit of integration in equation (19) 
now becomes 3r7?/4RsC*. If we use equation (38), this limit becomes 
3rT!/16RsC} . In either case this limit diverges as 7 increases. Thus 
Theorem 1 is still correct if the noise is colored. One can also see that 
Theorem 2 is true without any modifications. The whiteness assumption 
does enter into the proof in any substantial manner. 


IIA 


IIA 


VII. CONCLUSIONS 


In this paper we have attempted to formalize the notion of identifica- 
tion and examined conditions under which the a priori information would 
guarantee that the problem of identification was well formulated. Our 
purpose has been to indicate when identification was possible and not 
to specify a given identification procedure. It is hoped that the condi- 
tions derived here may motivate researchers to consider larger classes of 
identification problems than have hitherto been examined and also to 
indicate for what classes of problems identification is not possible. 
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APPENDIX 


Proof that the Space 3 Is Complete 


In this appendix we show that the space 3C with the metric induced 
by its norm is a complete space. If {H,,} is a Cauchy sentence in 3, we 
show that there exists an element A ¢ 3¢ such that lim ,....|| 1 — H, || = 0. 

Let {H,} be a Cauchy sequence in 3. Then given any « > 0 there 
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exists a number NV (e) such that if m,n > N(e), || H, — Hn || < ¢«. From 
the definition of the metric, 


PA = Pee 2 
|| Pra ||» 


for all T > 0, xe L.(s), || Prv ||2 4 0. Using the definition of L..(s), 
s(T)* > e|| Pra ||2 > || Pr(H.z — Ht) |le (40) 


for all n,m > N(e), T > 0, xe L.(s), || Pre ||, # 0. Thus, for each 
T>0,2¢eL,(s), || Pra ||. 4 0, {H,2} is a sequence of functions in Lp, 
and for each T > 0, P;H,x is a Cauchy sequence in L,[0, 7]. Hence, 
for each T there exists at least one time function yz « L., such that 
Pryr ¢ (0, ©) and lim,.,, || PrH,x — Pry ||, = 0. Furthermore, 
Yr 1S uniquely (except for a set of measure zero) specified over [0, 7’. 
Because of this uniqueness, if T; < T,, then Pr,yr, = Pr,yr, . Hence 
there exists a unique function 7 e L,, such that P;7 = Pry, for each 
T > 0. This function can be constructed: 


gt) =y,0) for OSt<1 


y(t) for 1s it<2 


lst<n 


yt) for n 


(41) 


For each xe L,,(s), « ~ 0 we have uniquely specified a function 7 ¢ Le, . 
For x = 0 we arbitrarily put 7 = 0. Call the operator defined by this 
association H; that is, Hx = %. We now show that lim,,.... || 1 — H, || = 0. 

For each T > 0, xe L.(s), || Prx ||2 # 0 we can use the triangle in- 
equality to show that 


| P-(Hz — H,2) ||. ||Prie — P,H,« ||. , || Pr. — 0.2) ||. 
< ! fe 


|| Pra ||2 7 || Px ||» || Pox ||> 





(42) 


If H,, H, are members of the Cauchy sequence, from our previous 
development we know that there exists a number N(¢/2) independent 
of x and T such that 


|| Pre ||» 





<¢/2 for m,n > N(e/2). (43) 
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Since lim,,.. || Pr(@z — H,,x) ||, = 0 we can find another number 
N*(e/2, x, T) > N(e/2) such that 


esi ia <¢/2 for m> N*(c/2,2,T). (44) 


Hence for all T > 0, Prx # 0 


ak as <e for n> M(e/2), (45) 


and if H were causal it follows that Ae X with lim || — Z, || = 0. 
The causality of H is easily established. For each xe L,.(s), T > 0: 


|| P,He — PpHP;e ||, 
|| PrpHx — PpH,z \|2 + || ProPst — PH, |\2 (46) 


For n sufficiently large each term on the right side may be arbitrarily 
small, hence || PpHx — PrHP rx ||. = 0 for all ae L,(s), T > 0. 

If % is composed only of linear operators the completeness proof 
follows as above except to additionally observe that 7 is linear. 





IA 
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The Theory of Cylindrical 


Magnetic Domains 


By A. A. THIELE 


(Manuscript received December 26, 1968) 


The theory of cylindrical magnetic domains provides conditions governing 
the size and stability of circular cylindrical magnetic domains in plates of 
uniaxial magnetic materials together with an estimate of the range of 
applicability of these conditions. The results of the theory are directly 
applicable to the design of cylindrical domain devices. Computation to first 
and second order of the energy variation resulting from general small 
deviation in the domain shape from an initially circular shape yields the 
conditions governing domain size and stability. The physical origin of 
the various terms in the energy expansion is examined in detail. A graph 
from which many domain size and stability properties may be obtained 
summarizes the results of the energy variation calculation. The minimum 
theoretically attainable domain diameter is approximately o,/7M? , where 
dy 18 the wall energy density and M, 1s the saturation magnetization. 
For domains to exist, the effective anisotropy field must be greater than 
4nM, . 


I. INTRODUCTION 


The recent development of a technique for the propagation of isolated 
magnetic domains in an arbitrary direction in anisotropic ferromagnetic 
thin films by P. C. Michaelis created a renewed interest in the use of 
domain propagation for device purposes." The technique used by 
Michaelis for propagating domains along the easy axis is quite different 
from that used for propagation along the hard axis. During discussions 
on the possible application of these techniques, A. H. Bobeck, U. F. 
Gianola, R. C. Sherwood, and W. Shockley suggested that for general 
symmetrical domain propagation the direction of magnetization must 
lie normal to the plane of the film’. The recognition that rare earth 
orthoferrites have the required properties came in response to this 
suggestion.” Experimental work on the application of this type of 
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domain motion device was then begun. Although at the present time 
this work has been largely concentrated on the orthoferrites, there exist 
other materials, such as the hexagonal ferrites and manganese bismuth, 
having the required properties. 

The present work directs attention to structures in which the prop- 
erties of the material used require the magnetization to lie normal to 
the surface of the plate. The modes of operation of devices constructed 
from such structures are classified according to the effect of wall motion 
coercivity. In the case of very high wall motion coercivity, the applica- 
tion of shaped applied fields determines the initial domain configuration 
which is then maintained by coercivity. For very low wall coercivity, 
on the other hand, the saturation magnetization, wall energy, plate 
thickness and bias field determine the domain size and shape. Between 
these two extremes, there is a continuum of intermediate modes. In 
either extremal mode, a complete set of operations (logic, memory, and 
transmission) may be performed.* The present work concerns only the 
low coercivity mode and specifically, right circular cylindrical domains 
in plates of uniform thickness and small variations therefrom. When 
observed by means of the Faraday effect, cylindrical domains have the 
appearance (particularly when in motion) of bubbles and therefore are 
colloquially referred to as “‘bubbles’’. 

The present work largely treats the theory of cylindrical domains with 
experiments and applications being considered only briefly. Section IT 
presents the domain model and mode of description. Section III contains 
the calculation of the energy derivatives used in the investigation of 
domain size and stability. Section IV contains an interpertation of the 
energy derivatives in terms of fields and potentials. Section V discusses 
the solution of the domain size and stability equations. Section VI 
discusses the range of validity of the domain model used in the previous 
sections. It is found that several assumptions implicit in the model are 
related, and a requirement on materials suitable for the production of 
circular domains is obtained. Appendix A contains a derivation the 
properties of certain elliptic integrals appearing in the theory of circular 
domains, Appendix B is a listing of the standard forms and series 
expansions of the magnetostatic force and stability functions, and Ap- 
pendix C is a list of mathematical symbols. 


II. THE DOMAIN MODEL AND MODE OF DESCRIPTION 


Figure 1 shows the magnetic domain structure to be considered 
here.” The isolated magnetic domain is magnetized downward while 
the remainder of the plate is saturated upward. The domain will be 
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Fig. 1— Magnetic domain configuration. 


considered to be near circular. Examination of the variation of domain 
energy under a variation of domain shape from the assumed unper- 
turbed shape yields domain stability. Once created, a cylindrical domain 
continues to exist if the magnetic configuration meets the conditions 
for stable equilibrium. The stability of a given configuration, however, 
does not guarantee that it can be produced. The generation of cylin- 
drical domains is a separate problem which is not treated here. 


2.1 Description of the Domain 


A cylindrical (7, 6, z) coordinate system is placed at the center of the 
domain with its z-axis perpendicular to the plane of the plate. The 
plate is taken to have planar surfaces and a uniform thickness h. Only 
the case of a plate of infinite extent, r; = ©, is considered here. It is 
assumed that the material constraints allow the magnetization to lie 
only along the z-axis and the magnitude of the magnetization is inde- 
pendent of the local magnetic field. The boundary between the two 
regions of magnetization, the domain wall, is assumed to be independ- 
ent of z (no wall bulging) and to have a width which is negligible in 
comparison to the domain radius. It is assumed that a wall energy 
density per unit area c,, may be assigned independently of either the 
orientation or curvature of the wall. The assumptions about the de- 
tailed magnetic configuration (the magnetization magnitude and orien- 
tation and the wall energy and shape) are coupled by the material 
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properties. Section VI contains a detailed discussion of the validity of 
these assumptions and the cylindrical wall assumption. Even though 
the foregoing assumptions appear quite drastic and restrictive, experi- 
mentally there does exist a region in which the results obtained under 
these assumptions are both accurate and useful. 

The expansion 


7,(0) = yr, cos [n(@ — 8,)] (1) 


of 7,(@) in terms of the Fourier coefficients, 7, and 6, , describes the 
domain shape in the plane. The 7 value is called the “rotational per- 
lodicity.”’ The condition 


|r 12> Dom [re | 2) 


assures that the domain is near circular and that the function 7,(6) 
is single valued and smooth. 

It is convenient to introduce the finite variations of the r, and 6, , 
Ar, and Aé,, respectively, in order to describe small variations in 
domain size and shape from the strictly circular domain of radius 
1 [75(0) = 70]. In terms of these variations, a small variation of the wall 
shape from r,(@) = ry) may be written as 


r(8) = 17> + Aro + Dd) Ar, cos [n(@ — 0, — Abn)] (3a) 
n=1 
where, by assumption, 
lro | >> | Aro | + Don | Ar, |. (3b) 
n=1 


Subject to the restrictions stated, equation (8) describes an arbitrary 
variation because of the completeness of the Fourier expansion. 

The externally applied magnetic field, H, is taken to be spatially 
uniform and to lie in the positive z direction. (The presence of a compo- 
nent of the applied field in the plane of the plate has no effect to the 
approximation that the magnetization lies only along the z-axis.) 

The assumed simple forms of the applied field and magnetic con- 
figurations permit the use of simple formal expressions for these quan- 
tities. The expression for the externally applied field is 


H = Hi, (4) 


where H is a constant and i, is the unit vector in the z-direction. The 
magnetization may be written in terms of the unit step function, 
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[> x <0, 
u(x) = \4, x = 0, (5) 
i x SD, 
as 
M = i,M,{1 — 2ulr,(0) — r]}u(z + sh)u(—2 + 3h). (6) 


2.2 The Energy Variation 


The investigation of domain size and stability proceeds by computing 
the first and second variations of the total system energy with respect 
to the r, and 6, . The total energy of the domain is 


Ep = Hy + Hx t+ Eu, (7) 


where Ey is the total wall energy, /, is the interaction energy with the 
externally applied field, and F y, is the internal magnetostatic energy. 
The total wall energy, under the previously stated assumptions, is the 
product of the wall energy density o,, and the wall area a: 


— i ple Sie [ {80 a3 Ea 3 do. (8) 


The interaction energy of the magnetization with the externally applied 
field is 


oo 2a ce) 
k,=-[MHav=-[ [ [ Murad, © 
Vv =o “60 0 
and the internal magnetostatic energy is 


fe ai ine ene va dV 


OM, 0M: 


ie) 27 ie) ee) 2r ee) / 
5 / i / / / ; 5ORs OE ar tel g dz dr’ dé’ dr’ 
+ 4-~ 40 0 -0o 40 0 Ss 


(10a) 





where 
s =r +r” — 2rr’ cos (0 — 0) + (2 — 2’)’. (10b) 


In expressions (9) and (10), V indicates volume and primes indicate 
quantities in the second coordinate system used in describing the in- 
ternal magnetostatic interaction. 
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The variation in the total energy when the r, and 6, are varied is 
= ol OL, 
a = $[(2) ax + (28) an 
n=0 n/’0 


1 00 0 Ee) ( OE » eral 
v5 Eps | (282 Dr) natn TANG Gg] ATA Om 


n=0 m=0 











aE, ) | 
ig (52 a6, POL On + 0; (11) 
where the subscript O refers to evaluation of the partial derivatives at 
the circular domain state, 7,(0) = 7), and O; refers to terms of order 
three and higher in the combination of Ar, and A6@, . The first partial 
derivatives of the energy, (0 ;/dr,)) and (0 7/06,)> , are the generalized 
forces of the system, while the second derivatives of the total energy 
form the elements of the stiffness matrix. 

Knowledge of the generalized forces and the stiffness matrix com- 
pletely characterizes domain size and stability. It is shown in Section 
III that only the energy derivatives, (@F 7/dro)) and the (0° 7/dr2)o, 
are non-zero when 7,(6) = r,. The equation obtained by setting the 
only nonzero generalized force equal to zero is called the ‘force equa- 
tion.” The expansion (1) is a quasi-normal mode expansion since cir- 
cular domains are completely metastable with respect to the @, and the 
stiffness matrix is diagonal with respect to the r, . 


III. CALCULATION OF THE ENERGY DERIVATIVES 


3.1 Derivatives of the Wall Energy 


The derivatives of the total wall energy are computed by substituting 
the wall shape expression (1) into the wall energy expression (8), 
noting that 

Ory _ 


ry a De, sin [n(@ — 6,)] (12) 


and differentiating under the integral sign. There results 


oe = he, [ {r cos [n(@ — @,)] — wren sin [n(@ — a} 


aE 4 (22) dé (13a) 





and 
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aE = ho, [ ; ({¢c [In(@ — 6,)} cos [m(@ — 8,,)] 


+ nm sin [n(@ — 6,)] sin [m(@ — ai}l = (%)" |" 
— {rn cos [n(@ — 0,)] — on sin [n(o — ani} 
Ory 





dr, cos [m(@ — 6,,)| — aa m sin [m(é — iI} 


Ec + (#7) dé (13b) 
with analogous expressions for 
OE y/A6, , PE y/00, 00m, and OH y/drp Om - 
Evaluating equations (13) for a circular domain, 
r,(0) = 7, and [dr,(6)/de] = 0, 


the circular domain derivatives are 


Bw) _ 

(22 . = Qrho,, (14a) 
oF») ane > 

( or. 5 e ro ho,.n ) n= 1 (14b) 


and all of the first and second derivatives of the total wall energy not 
explicitly stated are zero. 
3.2 Derwvatives of the Applied Field Interaction Energy 


The applied field interaction energy is evaluated by substituting the 
formal expressions for the applied field (4) and the magnetic configura- 
tion (6) into the applied field interaction expression (9), changing the 
order of integration, and integrating. 


—M.H if i. a {1 — 2ufr.(@) — r]} 


X u(z + sh)u(—z + 5h)r dz dr dé (15a) 


Ln 


2a 
nal | rae | — constant. (15b) 
0 


The infinite constant is independent of the 7, and @, and does not 
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contribute to the derivatives. Differentiating yields 





Oy ae 
ii 2QhM HH i r, cos [n(0 — 8@,)|dé@ (16a) 
and 
OE 27 
sO 2 Sar TT / cos [n(8 — @,)] cos [m(@ — 0,)]d8 —(16b) 
Or, Om 0 


with analogous expressions for 
OE /98, ; PE 7 /8%n 00m, And 87/0, Om - 
Evaluation of equation (16) for 7,(@) = 1 yields 


(22x) ae ee ere (172) 
OT /o 

2 
(2x) = 4nhMH, (17b) 
Orn /o 

2 
(“En) OFM ~y SO7 (17¢) 
Orn 0 


and all the other first and second derivatives of the applied field inter- 
action energy are zero. 


3.3 Derivatives of the Internal Magnetostatic Energy 


The formal expression for the internal magnetostatic energy is ob- 
tained by substituting the expression for the magnetic configuration 
(6) into expression (10). In dealing with the self-interaction energy, 
it is necessary to use two coordinate systems: an unprimed system and 
a primed system. Throughout the following calculation functions of 
the spatial coordinates (r, 6, and z) are written with primes whenever 
they are of the primed coordinates. Thus M, when considered as a 
function of the primed coordinates, is written M’. The subscripted r, 
and 6, are independent parameters and are never primed. 

The calculation begins with the evaluation of 0M ,/dz by differentiat- 
ing expression (6) and noting that 


d 

AG u(x) = 6(x) (18) 
where 6(x) is the Dirac delta function. Then 

OM, _ 4, 

a = M kg (19a) 
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where 

k[r, r,(@)] = 1 — 2u[r,(6) — 7] (19b) 
and 

g@) = i(z 4. n) — i(—z + ny, (19¢) 


After changing the order of integration, the expression for the internal 
magnetostatic energy becomes 


0 ie) 27 27 ioe) foe} , Iipol 
Ey=suif [ff oe meg ede! d0d0! dr de’. (20) 
0 0 0 0 —-27 4-0 


The factor g(z)g(z’)/s contains the z and 2’ dependence of this integral. 
From expression (19c) it can be seen that this factor consists of four 
terms. Application of the transformation (z, 2’) — (—z, —z’) to two of 
the terms under the integral sign combines these four terms into two 
terms. Making the transformation (z, 2’) — (z, z), where 


z=e-?, (21) 


on the remaining terms and carrying out the integration over z yields 
the expression for the internal magnetostatic energy in terms of an 
integral over surface magnetic charges. This expression is 


co) co) 20 27 Prana? 
Ey = Mz / / - BN id de! deae (22) 
0 Yo do 0 $ 
where Z is an operator defined by 


z= f delae) — a — WIL I (23) 


si = 7? + r? — Qrr' cos (6 — 6’) + 2’. (24) 


- The factor kk’ contains the r, and 6, dependence of the integral so 
that the derivatives of Ly, may be calculated by replacing this factor 
by its derivatives under the integral. Evaluating the first derivatives 
yields 


Okk’ _ , Ok’ or,(0’) , Ok _dr,(8) 
BB) oor: 











Orn : ar,(0’) ar, 


ok’ ! , Ok 5 j 
k ca cos [n(6’ — 9,)] + k ar, cos In(6 6.) } (25a) 
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0k ar,(6) 


Okk’ _ ak’ dr,(6’) 
ar,(0) 00, 


00,  or,(0’) 90, 














ek’ 
, 
= —k EL sin [n(6’ — 6,)] — k’ Ie sin [n(@ — 6,)]. (25b) 
Or, Or, 


Substituting these derivatives into the integral and exchanging the 
primed and unprimed r and 6. The first term becomes identical to the 
second. The derivatives of the internal magnetic interaction energy are 
then 


iu = oniz ff ie KS 


-cos [n(@ — 6,)jr’r dé’ dr’ dé dr (26a) 
OE = F i: [ a i’ 1 
Or, Om 2M. Z s 


dv a cos [n(6 — 6,)] cos [m(@ — 6,,)] + — dk’ dk 





Or, or, 
-cos [n(@ — 6,)] cos [m(@’ — 0p dé’ dr’ dé dr (26b) 


with analogous expressions for 
OF ,/00,, O° Ey,/dr,00,, and d°Ly,/80, 06n 


For circular domains (r, = 1) the factors k, k’, ak/dr, , and dk’ /dr, 
are independent of @ and 6’ so that the integrands are periodic in 6’ with 
periodicity 27. The range of integration of 6’ may therefore be changed 
from [0, 27] to [6, 27 + 6] so that after making the transformation 
(0, 6’) — (6, ¢) where 


c =. = 6, (27) 
the range of integration of both 6 and ¢ is again [0, 27]. Note that now 
garter? — 2’ cosh +2 (28) 


depends only on ¢. 

Using trigonometric identities, the integrands of the integrals for 
the various derivatives are written as a sum of terms each of which is 
the product of a factor depending only on 6 and a factor depending on 
¢. Carrying out the integration over 6 yields, for 7,(6) = 10, 


(2) = 4nM?z [ " / i k’ Oh, dé dr’ dr, (29a) 
0 0 40 Yo § or, 


Or 
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ak ok’ ok), 
ArM? Z a [ [ (ne ort ‘an, ar, ’r dé dr’ dr, (29b) 


o |e 
oS 
yy 
Mn” 
oO 
I 


ox) = ne des a a fo 1 
are : = 9 4r M,Z ; ap 3 
, OK | ak! dk ) 
(wy & 2 + ar, ar, cos n¢ }r’r dt dr’ dr, n> 0, (29c) 


while all the remaining first and second derivatives are zero. Note that 
by inspection of these integrals and the definitions of k, k’, and r, that 


oO 
Ex) 5 50. (2x) 
(ee ar, Jo ~~ Aro \ Oro /o (302) 
and 


(“2x) a 1 2 (ax) ay Miz [ [ °" ak’ dk (1 — cos ng) 
Or, | 20% \ OT /o 2 ae 0 Jo Jo OF, OY, 8 


-r’'r dg dr’ ar, n> 0. (30b) 
Noting that from expressions (18) and (19b) 


ok 

ance —2é6(r — r,) (31) 
and using the definition of the Z operator given in expression (24), 
expression (29a) may be integrated with respect to r and z, and the 
second term of expression (80b) may be integrated with respect to r, 7’, 
and z. The result after some rearrangement is 


(Bx) = —(2mh?) (44M?)F(2ro/h), (32a) 
(2B) = —(4mh) (40?) NIE (32b) 
(8s) — — ayy A 


i (n(anar’) 22 | (2) 2 1.0) | (32¢) 


where renaming 7” to 7 and using expression (19b) 


p(™) = 2 = 2 [2B(r >To» h) = 2B(ro To; 0) 


— Bro ge 9 h) i Bro py 0)], (33a) 
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Bey (#32) = i, ; I "(2 4 2) le dr dg, (33b) 


p =r+r — 2zor cos f, (38¢) 


and where 
Ll(e/2re)*] =f” ((¢/2re)® + 31 — cos AIHA — cos ng) dt. (84) 


The L, functions are reduced to standard elliptic integral form, and 
power series expansions are obtained for both large and small values of 
the argument in Appendix A. The B function is integrated once after 
displacing the origin of the cylindrical coordinate system from 0 to 
0’ as is shown in Fig. 2. The transformation connecting the (r, ¢) and 
(p, v) coordinate systems is 


psing =rsin ¢ (35a) 


pcosg =rcost—T%. (35b) 


Tv r=. f¢ ad d 
i / Ga Tm <1; (36a) 


Bro » Ts , Z) = 
Tv Torry _pdp dy _ 
If I (p? + 7)? ro = Tye (36b) 


Equations (86a) and (86b) are integrated to obtain, in either case, 


After the transformation 


ven 2 2\4 Fe 
Brest. =f G+ 2hde—f  izlae G7 


where p, is the value of p along the boundary r = r,. For r; = 1, pp = 
—2r) cos ¢ so that , 





Fig. 2— The (¢, r) and (p, ») coordinate systems. 
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BOA tee = i [h? + (2r)? cos’ v}? dy — 5 | h | (38a) 
and - 
Bes to OY Bt (88b) 
The remaining two terms of equation (33a) must be evaluated as a limit 
lim [B(ry , 7, , 0) — Biro , v7; , h)] 


rrrn 


lim [= ((o8 + 8)! — pid dy + = [| 
0 


rf7rn 
=a{h| 
since p, approaches infinity when 7; approaches infinity. Combining 


these results yields 


F(2ro/h) = : erly [" [(h/2r0)” + sin” gl dp — i} (40) 
and 


ay = (4/2n9){ 2F o/h) ~ = I  [(h/2re)? + sin? ol! leh (41) 


(39) 


Appendix B lists the standard elliptic integral form of the force 
function F and power series expansions for large and small values of 
the argument. In Fig. 3 the force function is plotted as a function of the 
domain diameter measured in units of the plate thickness 


d/h = 2ry/h. (42) 


The stability functions S, , also shown on this plot, are defined in Sec- 
tion 5.1. 


IV. THE ENERGY VARIATION—ORIGIN OF TERMS 


Summing the results of the last section according to expression (7), 
the total energy variation expression (11) is 


AE = [2rho,, + 40rhM.H — (2rh’)(4rM%)F(2r./h)] Aro 


1 an » OF (2r/h) é 
5 | srha.H (Anh) (An) “Son Thy |am 
OF (2r,/h) 


1 : Ls 2 y a 2 
a5 du ‘2 hon + 2ehM.H — (2rh)(40rM?) 3(Ora/h) 


2ro 


+ (h) 4a.) = (La((h/2r0)") — ONC + O3 (43) 
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Fig. 3—The magnetostatic radial force function F and stability functions, 
So — Si, S: = 0, as functions of domain diameter to thickness ratio, d/h. 


where F is defined by expression (83) and plotted in Fig. 8, the Z, are 
defined by expression (84), and all terms not explicitly stated are equal 
to zero. The remainder of this section treats the physical origin of the 
terms in the energy variation expression (48). 


4.1 The Generalized Forces 

The coefficients of the linear variation terms are the negatives of the 
generalized forces. All forces except the 7, force are identically zero, 
which for a circular domain is a consequence of the rotational symmetry 
of the system. The first term in the coefficient of Ar, is the product of 
the wall energy density o,, and the rate of change of wall area with re- 
spect to 7>, 27h. The second term is the product of the external field 
interaction energy density 2M,H and the rate of change of domain 
volume with respect to ro, 2rhr, . The third term is the rate of change 
of the internal magnetostatic energy with respect to r). 

The internal magnetostatic force may be identified in expression (43) 
and using expressions (32), (83), and (89) may be written in the form 


= (2B) = Qh?(4rM?)F (2ro/h) (44a) 


= (nro) QM a, 


ELE Cee LL} wo 
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where p’ is given in expression (33c). In expression (44b) the first factor 
in parentheses is the domain wall area, the second is the change in 
magnetization at the wall as the wall moves, and the quantity in braces 
has the form of an H field whose origin will now be interpreted by super- 
position of sources. The internally produced field arises from the super- 
position of the internal field of a plate uniformly magnetized normal 
to its surface with magnetization magnitude M/, and two disks of mag- 
netic charge of uniform magnetic surface charge density +2//, and 
radius 7°) . The first term within the braces is thus the demagnetizing field 
of the infinite plate of uniform magnetization. The second term is the 
difference in magnetostatic potential between a point on the edge of a 
disk of magnetic surface charge of uniform density 4M, and a point 
removed a distance h from this point in a direction normal to the plane 
of the charge disk divided by the distance h. This is just the z-averaged 
z-component of the field produced along the wall by the two charge 
disks since 


Hw = 5 f Hede =; [ — 24g, = O— 20) (45) 


where Q denotes the magnetostatic scalar potential and z is measured 
from the edge of the disk. Comparing expressions (44a) and (44b), the 
total internally produced z-averaged z-component of the magnetic field 
along the domain wall is 








(A at, )av i (47M) (h/ 21) F (2ro/h) (46) 
so that the total force per unit wall area (averaged over 2) is 
__1 (aE :) _ oe 
Qaroh (2 a 2M CH + (Hl ar,)e). (47) 


The first term is the product of the wall energy density and the wall 
curvature and always corresponds to an inward directed force. The 
second term is the change of magnetization at the moving domain wall 
times the z-averaged z-component of the total field at the wall. [The 
problem may initially be set up using this fact (Ref. 2, pp. 1922-1925). ] 
The properties of the force function will now be examined in some detail. 
From expressions (48) or (44a) the first order variation in internal 
magnetostatic energy, when 7 is varied, is 


AEy = —2(ah*)(4aM?2)F(2ro/h) At . (48) 


The plot of F in Fig. 3, and the expansions for large and small values 
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of the argument show that the force function is everywhere positive, 
is monotonic increasing, and has a negative second derivative. Since 
the force function is everywhere positive, the internal magnetic inter- 
action energy at all times acts in such a way as to expand the domain. 
Section 4.2 treats the effect of the slope and curvature properties of the 
force curve on domain size and stability. Substituting the expansion of 
the force function for small values of the argument (188d) into expres- 
sion (48) produces the energy variation for small values for o/h, 


AE = {2rhr.2M,(—40M,) + (44M 21675 
— (wh?) (40M )2[4(2ro/h)* — de(2ro/h)? + ++-]}Aro. — (49) 


(In the remainder of this section frequent reference will be made to the 
properties of F and the LZ, given in Appendices A and B.) The inter- 
action of the magnetization with the existing field from the infinite 
dipole sheet produces the first term in expression (49). This may be seen 
by comparison with expression (47) and observing that the field in- 
ternally generated in the infinite dipole sheet with no reversals is 
—4rM,. In Fig. 3 a dashed line through the origin with numerical 
slope one represents this term and forms the small r,/h asymptotic of F. 

The second term in expression (49) is the only thickness independent 
term in the expansion and therefore must be identical to the variation 
of self-energy of the two disks of magnetic charge which form the ends 
of the reversal when 7 is varied. Since the interaction with the infinite 
charge sheet and the self-energy of the disk have been taken into ac- 
count, the remaining terms are the mutual interaction of the magnetic 
charge disks. 

For large ro/h, an energy expansion in terms of h/r, is appropriate. 
Substituting the expansion of the force function for large values of the 
argument (138c) into expression (48) yields 


AE y = — WM XU cg #5(h/2r)” + 0] 


rg 


oop 


+[2 — 3(h/2r0)’ + 0,] In 








\ Aro: (50) 


This expansion obscures the identity of both the infinite sheet magnetic 
field term and the charge plate self-energy term so that a local (to the 
wall) magnetic energy lowering per unit line length description appears 
appropriate. However, the energy reduction per unit line length to 
lowest order in 27)/h is 
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If», (domain) — Fy (uniform magnetization) 





2r1y 


4 


2ro 
eh | (51) 





h? ; 
= —" (49M) In 
Tv 


so that the energy lowering per unit line length for the domain of in- 
finite diameter is infinite. [Equation (51) is obtained by integrating 
equation (50) to lowest order. The integration constant is determined 
to be zero by term by term integration of the expansions of F for large 
and small values of the argument and comparing at 2r,/h = 1.] The 
conclusion that the energy lowering per unit line length for an isolated 
straight line reversal may also be obtained by considering the energy 
lowering in a strip reversal when the strip width approaches infinity. 
The author’s intention at the outset of this entire calculation was to 
calculate the numerical value of this magnetic energy reduction per 
unit wall length. The internal magnetic interaction, however, retains 
just enough of its global character when the domain is very large so that 
no finite limiting value for this energy reduction exists. 
The internally generated magnetic field at the wall of the domain, 
for large 7,/h is obtained from expression (46). To lowest order it is 
1 27 
In | 4e h 


(52) 


4aM.) h 
(Hut.Jav = aie) dp 
T 


To 





which approaches zero as the diameter approaches infinity as it must, 
since for an infinite straight line magnetization reversal, symmetry 
requires that the z-component of the field be zero along the reversal. 


4.2 The Stiffness Matrix 


The second variation of the energy with respect to the Fourier coeffi- 
cients describing the domain determines the stability of the domain. 
Since the stiffness of the domain with respect to externally applied 
forces is proportional to the coefficient of the bilinear form which is the 
second variation of the energy, the matrix formed by these coefficients 
is called the stiffness matrix. The stiffness matrix is composed of three 
independent submatrices. The second derivatives of the energy with 
respect to the Fourier amplitudes form the radial stiffness matrix; 
the second derivatives of the energy with respect to the Fourier phases 
form the angular stiffness matrix; and the derivatives of the energy with 
respect to one Fourier amplitude and one Fourier phase form the mixed 
stiffness matrix. The derivative of the energy with respect to r, and rn 


3304 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


are called the (n, m) radial stiffness matrix element, with similar nota- 
tion for the other submatrices. 

All derivatives not explicitly exhibited in expression (48) are zero. 
Thus, the angular stiffness matrix and the mixed stiffness matrix are 
zero and the radial stiffness matrix is diagonal so that the system is 
completely metastable with respect to angle and the amplitudes are 
normal modes of the system for small amplitudes. 

The (0, 0) radial stiffness matrix element is simply the derivative of 
the negative of the radial generalized force so that no further discussion 
of it is necessary. It should be noted that the derivative of the internal 
magnetostatic term with respect to wall position is not directly related 
to the radial field or potential at the wall since the derivative used in 
computing the radial field at the wall must be taken with the wall posi- 
tion held fixed. 


4.2.1 The Radial Stiffness Matrix Elements for n = 1 


The diagonal radial stiffness matrix elements, for n = 1, are the sum 
of four terms in expression (43). The first term, which always has a 
stabilizing effect, is the increase in total wall energy due to the lengthen- 
ing of the wall caused by the deviation from a strictly circular shape. 
Imposing a sinusoidal variation of amplitude As onto a straight line 
produces a relative increase in length of 


s+ As 
8 


The corresponding wavelength in expression (48) is 


=o QT 


Na (54) 
The wall energy term in expression (48), 
2 
Alin = o-2argh( A) ’ (55) 


is thus the product of the wall energy density, the wall area, and the 
variation in wall area per unit area. Notice that the relative variation 
in wall length or area is independent of the wall curvature, 1/ro, to 
lowest order in the amplitude of the variation. 

The second order change in volume of the domain interacting with 
the externally applied field produces the second term in the radial stiff- 
ness matrix elements while the rate of change of the internal mag- 
netostatic forces at the wall produces the third term. The sum of the 
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second and third terms is one-half the (0, 0) radial stiffness matrix 
element. This factor of one-half relates to the fact that a variation of 
Ar, , ” = 1 produces only one-half the mean square variation 7,(6) 
as is produced by an equal variation in r,. This shape-independent, 
second-order variation in energy arises from the variation in the gen- 
eralized forces, or fields at the wall, when the domain radius is varied. 
[See also the steps leading to expression (30b).] 


4.2.2 Translation Invariance 


The requirement of translation invariance in the infinite plate com- 
pletely determines the (1, 1) radial stiffness matrix element. Consider 
a cylindrical domain of radius 7) with a cylindrical coordinate system 
placed at its center. Under a displacement of the coordinate system of 
magnitude s in the 6 = 7 direction, the description of the boundary 
in the new coordinate system is 


2 2 
11(0) = ty — G+ 8 cos 0+ 7 ~ cos 20 + 0. (56a) 


Thus, to second order in s, term by term comparison with definition 
(3) yields 





1s’ 1s” 
Afro = =e. j Ar, =s, and Ar, = 4 (56b, c, d) 
The formal change in energy under this displacement (11) is 
= G2) +408 
AE = (2 ae. 5 + 2\ar/,° a Oars (57) 


Obtaining (0£/dro), and (0°E/dr?), from expression (43), and substitut- 
ing expressions (84), (85), (86), (100), and (138a) verifies that 


or) _1 (282) 
( Ori Jo 2% \OPo / 0 (58) 
The coefficient of s° in expression (57) is thus zero as required by transla- 


tion invariance, and further the (1, 1) stiffness matrix element is zero 
whenever the total radial generalized force is zero. 


4.2.3 The Magnetostatic Stiffness Terms 


The interpretation of the radial stiffness matrix elements for the 
higher n values is now considered. As in the case of the generalized 
forces, examination of the expansions for small 7, allows the self-inter- 
action energy of the two charge disks which make up the ends of the 
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domain to be separated from the mutual interaction of these charges. 
The variation in the internal magnetostatic energy due to a variation 
in some r, for a circular domain is in general from expression (43) 


AE im = teil’ - (rh) aon: Ee ro) af) 2 1a(0) | esr 


nm = 1. (59) 


Separating the h independent and h dependent terms of the power 
series representation in powers of 27)/h uniquely separates the above 
expression into two parts, one part representing the self-interaction of 
the charge disks and the other representing the mutual interaction of 
these disks. The h independent terms then represent the self-interaction 
forces of the charge disks and the h dependent terms represent the 
mutual interaction forces. In the expansion of L, for large (h/2ro)’, 
expression (129a), all terms of L,[(h/2ro)”] are h dependent. Using the 
large (h/2r,)” expansion of F, expression (138d), and the expressions 
for L,(0), (115) and (116), the thickness independent part of expression 
(59) is 





AE y,(Self) = ixM'r( 1 Yar, , n2zil. (60) 
This energy variation contains a term which results from the variation 
in the overall size of the disks of charge as well as the shape dependent 
terms. The size variation term will now be identified and subtracted 
out so that the shape dependent part of the self-interaction energy may 
be seen explicitly. From expression (49) and the discussion following it, 
the ratio of the variation in energy of two isolated disks to the variation 
in disk area is (47%) (16r,/27). The variation in disk area for a varia- 
tion in r, for n = 1 is (r/2)(Ar,)’ so that the change in self-energy of 
the two disks, other than that due to their mutual interaction or change 
in overall size, is 


AE un(Self-Shape) 
0, n=1 (61a) 


— nit)sr( 2, sa )tany, n>1.  (61b) 


It is not surprising that the variation of r, produces no shape related 
energy change, since from expression (56) this variation is to lowest 
order a displacement with a size change coming in second order. It is 
seen that the terms which remain after cancellation all come from L,(0). 
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In expression (59) the first term is independent of n and the 
—4rM?r,L,,(0)(Ar,)* term has been identified with the variation in the 
self-energy of the charge disks. The term 47M/?2r,L,[(h/2ro)"](Ar,)” must 
therefore contain all of the shape dependent part of the charge disk 
mutual interaction energy. This term also contains a contribution due to 
the variation in the total amount of charge and contribution due to the 
shape independent, general smearing out of the charge distribution. 
Since the second order change in the total amount of charge is inde- 
pendent of n for n = 1, these two contributions may be removed from 
the mutual interaction energy variation by replacing L, by L, — Le . 
The remaining mutual interaction energy variation is specifically due to 
the shape of the variation. This energy variation is 


AE an,(Mutual-Shape) 
h? h? 
= (44M), | zs] = Lal) |e n21 
0 0 
2Qn4+1 yn 
= (4rM; ) To ar. noi?) + M,., se?) Fecal, 
n2l (62) 


where the final form is obtained using the expansion for D, , equation 
(131), and the ,,,,, are the constants of the expansion. The interaction 
energy of planar multipoles of order n and higher has the form of equa- 
tion (62), as it must since the variation in the charge distribution for 
each n may be expressed in terms of such multipoles. 

The variation in internal magnetostatic energy due to a variation of 
r, , in the infinite sheet, for large 2r,/h, to lowest order in h/2rp , is 


h? 
Alun = ru)() 
ic 2n? In 


using equation (59), the large 27./h expansion of F (138c) and of 
L, (h?/4r3) — L,(0), (105), (116) and (125). 

The charge-disk self-interaction energy is not evident in this ex- 
pansion because it is exactly cancelled by the leading term of the mutual 
interaction energy. In contrast to the energy reduction per unit line 
length for a straight line reversal in an infinite sheet (which has no 


2ro 
ar h 





Ene 


n21 (63) 








3308 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


finite value), it is possible in the case of this variation of the domain 
structure to compute the energy variation per unit line length. In terms 
of the wavelength of the variation \, defined in expressions (54) or 
(127) and in the limit of 7, — ©, the total variation in energy when r, 
is varied is [using expressions (43) and (55) and the limit (128)] 


AE _ {[ sea/h — (4nM?)x In | | a+ o6(* hare) (64) 


271 wh 

Comparison of expression (64) with expression (53) shows that the 
magnetostatic energy variation per unit line length for a circle of infinite 
diameter is the product of the magnetostatic energy density constant, 
the variation in line length, and the logarithm of a maximum effec- 
tive interaction distance, 4e\/z. (The maximum effective interaction 
distance for the magnetostatic energy lowering per unit line length is 
proportional to 7).) Hagedorn has computed the magnetostatic energy 
variation per unit line length for the case of a sinusoidal variation im- 
posed on an infinite straight line reversal.° The calculation was carried 
out by considering the energy variation produced by a sinusoidal applied 
to a strip domain pattern in the limit of infinite strip width. The result 
of this calculation is 


AE x,/(anit length) = —(4rM%)q In | d/(2.111h) | (h/d)*(Ar)’, (65) 


which differs from the result for the infinite circle by the constant inside 
the logarithm. 








4.3 Summary 


The physical origin of terms of the energy variation has thus been 
traced in the limiting cases of both large and small r,/h. In either of 
these limiting cases, it is thus possible to develop intuition with regard 
to the behavior of the domains. Since, as has been shown, the inter- 
pretation of the meaning of the energy terms in the limiting cases is 
qualitatively different, the development of intuition in the transition 
region is quite difficult. In many device applications this transition 
region is the preferred region of operation, making the use of analytical 
and numerical methods a necessity. 


V. THE SIZE AND STABILITY OF CYLINDRICAL DOMAINS 


The energy variation expansion (43) in principle contains all cylin- 
drical domain size and stability information. This section treats briefly 
the use of this expression in the determination of domain size and 
stability. The only non-zero generalized force in expression (43) is the 
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uniform radial force. When this force is set equal to zero (the force 
equation), the system is in equilibrium. Thus the condition that the 
system be in equilibrium provides, given a material and plate thick- 
ness, an equation relating domain size and the applied field. The loca- 
tion of the zeros in expression (43) (all terms not explicitly exhibited 
are zero) shows that the system is completely metastable with respect 
to angle and that the radial stiffness matrix is diagonal. The radial 
amplitudes are thus quasi-normal modes, and the study of stability 
reduces to the study of the stability of the individual radial amplitudes. 


5.1 Normal Form of the Energy Expansion 


Before proceeding with the discussion, it is appropriate to introduce 
some new notation and to rearrange the energy variation expansion 
into what will be called normal form. Since the stiffness matrix is of 
interest only when the domain is in equilibrium, the applied field H 
is eliminated from it using the force equation. The geometrical de- 
pendences of the various magnetostatic stability terms are then com- 
bined and normalized to the wall stiffness term by defining the “stability 
functions” as 


S.(d/h) = F@/h) — d “ F(d/h) (66a) 
and 


sal) = — +5 {suain) + & @ MLA) — LO}, 





nm = 2. (66b) 


The S, function is undefined or may be taken to be zero since transla- 
tion invariance in the infinite plate requires that the (1, 1) stiffness 
matrix element the identically zero whenever the generalized radial 
force is zero, aS is assumed to be the case here. The S, functions are 
plotted in Fig. 3 up to S,. ; they are given in standard elliptic integral 
form together with power series expansions for large and small values 
of the argument in Appendix B. The domain diameter, d = 2rp repre- 
sents domain size in this section. The normal form of the energy expan- 
sion is written as a function of the ratios of the three fundamental lengths 
of the system: the plate thickness h, the domain diameter d, and the 
“characteristic length’ defined by 


l= aie (67) 
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The characteristic length depends only on the type of material used. 
Dividing the energy variation expansion (43) by the normalizing 

energy 2(47M?)(rh*) and introducing the notation of the preceding 

paragraph, the normal form of the energy expansion results: 


wastes - (b+! H = #(2)| 
2(4r M2) (rh®) = Lh ' hh 4M, h h 


ACGME - se) 


+ Ya - »(*[ = s(2) (42) +05. (68) 


In expression (68) the coefficient — [l/h + (d/h)(H/4rM,) — F(d/h)] 
is the normalized radial force. Setting this force equal to zero yields 
the normalized force equation. The remaining bracketed quantities 
[l/h — S,(d/h)] are proportional to the diagonal elements of the stiff- 
ness matrix, and are called “stability coefficients.” For uniform radial 
variation, the stability coefficient has the opposite sign from the (0, 0) 
element of the radial stiffness matrix; thus this stability coefficient 
is negative whenever the domain is stable. For the other r, variations, 
on the other hand, the stability coefficient has the same sign as the 
corresponding element in the stiffness matrix, and these stability co- 
efficients are positive whenever the domain is stable. 











5.2 Graphical Solution of the Force Equation 


A graphical solution to the force equation 


Li dH ~ r(4) = 
h* h4eM, rt Pa ) 


may be obtained by constructing a straight line on Fig. 3 whose inter- 
cept with the vertical axis is //h and whose numerical slope is H/47M, . 
The intersections of this straight line with the F curve are then the solu- 
tions to the force equation. 

As was stated in Section 5.1, (z) the force function has a positive first 
derivative and negative second derivative for all nonzero values of its 
argument, (27) it is zero and has a first derivative of unity when its 
argument is zero, and (272) it becomes logarithmic for large values of 
its argument. From these properties and examination of Fig. 3, several 
properties of the solutions to the force equation may be appreciated. 
For negative values of the applied fields, there is only one solution to 
the force equation. Examination of the sign of the radial force which 
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results when the diameter is varied about the solution diameter while 
all other variables are held fixed shows that this solution 1s unstable. 
For small positive applied fields, there are two solutions to the force 
equation, the larger diameter solution being radially stable, the other 
radially unstable. However, a radially stable solution does not guaran- 
tee that the system is stable with respect to all possible deformations, 
and this must be investigated separately. As the applied field is increased, 
the two solutions move closer together until they coalesce. When the 
applied field is increased beyond this point, there are no solutions. 
Since the function F is asymptotic to a straight line through the origin 
having unit slope, the solutions will always vanish for a value of the 
applied field which is greater than 47/7, . Stable isolated cylindrical do- 
mains thus exist only in the presence of an applied field having magni- 
tude between zero and 47M, and polarity tending to collapse the 
domain. 


5.3 Graphical Determination of Domain Stability 


The stability coefficients are determined graphically by constructing 
a horizontal line at height 1/h on the force stability graph. Metasta- 
bility for each normal mode of deformation occurs at the intersection 
of this line with the corresponding stability function. Since the stability 
functions are monotonic, the diameter of metastability of each normal 
mode of deformation is uniquely defined and forms the boundary be- 
tween the regions of stability and unstability. The circular domain will 
be stable with respect to all variations when its diameter is greater than 
the radial metastability diameter and less than the metastability diam- 
eter for a variation with a rotational periodicity of two. The normal 
variations with rotational periodicity two are referred to as “elliptical’’ 
deformations. When the domain is stable with respect to elliptical 
deformation, it is necessarily stable with respect to the variations 
of higher spatial frequency since the stability functions of higher spatial 
frequency lie progressively (with respect to n) below the elliptical sta- 
bility function. The radial stability function S, and the elliptical stability 
function S, thus form the boundary of the region of total cylindrical 
domain stability. Therefore, given the magnetic material type and 
plate thickness, the range of stable domain diameters and the correspond- 
ing applied fields may be determined with the aid of these functions. 


5.4 Minimum Domain Diameter 


For any given value of 1/h, the minimum domain diameter is the 
collapse diameter determined by S, . The domain diameter measured in 
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units of the characteristic length is d/l = (d/h)/(I/h) which is the in- 
verse of the numerical slope of a line drawn in Tig. 3 from the origin 
to the operating point. The line of maximum slope, which both passes 
through the origin and contacts the S, curve at at least one point, thus 
determines the smallest domain diameter attainable in a given material. 
The coordinates of this contact point are d/h ~ 1.2 and 1/h & 0.3, so 
that the minimum attainable domain diameter is 


dmin & 4. (70) 


VI. RANGE OF VALIDITY OF THE MODEL AND THE QUALITY FACTOR 


At the present time, no quantitative evaluation of the range of valid- 
ity of the domain structure model used here has been carried out. The 
qualitative discussion given here, it is hoped, will provide the reader 
with an appreciation of the magnitude of the effects produced by the 
relaxation of the various constraints artificially imposed by the model 
and the dependence of these effects on the system parameters. It has 
been assumed that domain walls are cylindrical, have zero width, and 
have a definite energy per unit area which is independent of wall orienta- 
tion or curvature, and that the magnetization lies perpendicular to the 
surface of the plate. Section 6.1 treats the effect of the relaxation of 
the cylindrical wall approximation only. In Section 6.2, the other 
assumptions are all shown to be coupled using the simplest uniaxial 
material model. A single dimensionless material parameter g, which 
complements the characteristic length J in characterizing circular domain 
materials, is used to express the results obtained from the simplest 
material model. 


6.1 The Cylindrical Wall Approximation 


The discussion of the cylindrical wall approximation uses the coordi- 
nate system and domain configuration of Fig. 1 except that the walls 
are allowed to curve as shown in Fig. 4. The radius function, 7,(6, 2), 18 
determined by the requirement that it minimize the total energy. The 
Euler equation which results from this two dimensional field variational 
problem is an integro-differential equation similar to those which appear 
in Hartree self-consistent field calculations. No solution of this equation, 
numerical or otherwise, has been attempted or is contemplated at the 
present time. The Euler equation consists of terms arising from: the 
wall energy, the interaction of the magnetization with the applied field, 
the self-interaction of the magnetostatic charges at the surface of the 
plate, the self-interaction of the charges produced by the slope of the 


CYLINDRICAL MAGNETIC DOMAINS 3313 
aoe hb. be = ws 
1PF , 
me 
| 
H{ h 
| 
| 
| 
es 


h 
XN 
‘IS 


Fig. 4— Cross section of a noncylindrical, near-circular, domain. 


domain wall, and the mutual interaction of the plate surface charges 
with the domain wall magnetic charges. Boundary conditions (obtained 
from the appropriate transversality condition’) require that the wall 
surface be perpendicular to the plate surface at all intersection points 
(3 in Fig. 4). Physically (since in the model used here the crystal is 
assumed to be strain free) the surface cannot interact with the domain 
wall, and therefore the wall must intersect the surface at right angles. 

Although it is not clear that domains having a roughly conical shape 
are ruled out, it will be assumed that the domain has reflection symmetry 
through the central plane of the plate and that the radius is a function 
of z only, r,(z). In this case the wall must be vertical at the central 
plane as indicated at © in the figure so that the single parameter b 
represents the magnitude of the wall bulging. Since the Euler equation 
requires the curve to be smooth, there must be an inflection point, J, 
between @ and 3. The wall area, and thus total wall energy, is a quadratic 
increasing function of the wall curvature so that the concentration of the 
curvature at the center and ends of the wall, produced by the trans- 
versality and symmetry conditions, tends to reduce the wall bulging. 

The radial field at the domain wall from the charges at the surface 
of the plate is directed as shown in Fig. 4. The effect of the interaction 
of the magnetostatic charges due to the slope of the wall with the radial 
component of the field from the surface charges is destabilizing for 
either positive or negative bulging. This interaction produces a negative 
quadratic term in the total energy. However, at the plate surface, where 
the magnitude of the radial field is greatest, the transversality condition 
requires that the charge density produced by the wall slope is zero so 
that the magnitude of this negative term is small. The z component of 
the field from the charges on the surface of the plate determines the 
direction of bulging. (The applied field, being uniform by assumption, 
need not be considered.) Along an initially cylindrical wall the internal 
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field is everywhere directed, so as to make the domain expand, and 
attains its greatest magnitude at the center plane of the plate. It there- 
fore provides a linear term in the total energy which tends to bulge the 
wall in the positive direction as shown in Fig. 4. 

Thus, for near cylindrical walls, the bulging is determined by the 
interaction of this force (tending to bulge the wall) with the wall energy 
(acting to stabilize the wall) and the radial field (acting to destabilize 
the wall). The self-interaction of the wall charges enters only as a 
higher-order term. It should be noted that the transversality condition 
acts both to strengthen the stabilizing term and weaken the destabilizing 
term. 

The relevant dimensionless wall energy for the wall bulging problem 
is l/h = o,,/(h4rM?). Wall bulging is expected to decrease with increas- 
ing wall energy. A second independent effect related to I/h may be 
appreciated by inspection of the S, and S, curves in Fig. 3. It can be 
seen from Fig. 3, equation (68), and the discussion following it that, 
since the S, and S, curves bound the region in which stable circular 
domains exist, d/h must increase with increasing //h. By symmetry, 
the z-component of the internally generated magnetic field at a cylindri- 
cal wall is zero for a domain of infinite diameter and clearly increases 
monotonically as the domain diameter to thickness ratio decreases. 
Thus, as the plate is made thicker, the bulging force becomes stronger 
and the stabilizing force becomes weaker. Since several independent 
effects cooperate to increase bulging with increasing plate thickness, 
the onset may be quite rapid when it does occur. Domain collapse data 
taken at d/h 1is in good agreement with predictions made on the basis 
of equation (68) and Fig. 3.° This then provides some indication that 
the cylindrical wall approximation remains valid at this thickness. 


6.2 The Quality Factor 


The discussion of the approximations other than the cylindrical wall 
approximation uses a polar (M,, 7, v) coordinate system where 7 is the 
polar angle and » is the aximuthal angle to specify the orientation of 
M, (See Fig. 5). The polar axis is taken to be the z-axis of the preceding 
sections. The domain wall is taken to be planar with its position and 
orientation specified by a plane at its center. The axis through the origin 
in the direction of the wall normal is denoted by &. The position of the 
central wall plane is denoted by & . The orientation angles of the wall 
normal are denoted by »,, and 7,,, (see Fig. 5). 

In the simplest uniaxial material whose easy axis is the z-axis the 
magnetic energy density for a planar wall is (Ref. 9, pp. 189-192) 
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Fig. 5 — Coordinate system for specification of domain walls. 


0 . * 2 ‘ - 2 2 
pr = al (22) + sin o(%) | + K,sin° 7 —-H-M + 2 Ze (ME) 


(71) 


where A is the exchange energy density coefficient, K, is the anisotropy 
energy density coefficient, H is the externally applied field, and the last 
term is obtained by integrating V-B = 0. For a uniformly magnetized 
material in the absence of applied or internal fields, this expression 
reduces to pz = K, sin” 7 which has absolute minima at 7 = 0 and 
7 = w. The z-axis is thus the easy axis as is required for consistency 
with the preceding sections. 

The anisotropy energy density coefficient is sometimes expressed in 
terms of the effective anisotropy field H, = 2k,/M,. The quality factor 
is now defined as the dimensionless anisotropy energy coefficient or 
dimensionless anisotropy field 


— Ky a i, “ 
1 = OFM? 4nM, 








(72) 
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6.2.1 The Nucleation Field 


When a bias field H is applied in the positive z-direction and demag- 
netizing fields are neglected the energy density is 


px = K, sin? n — HM, cos 7 (73) 


which, for H < H,, has a local minimum at magnetization orientation 
n = mw and an absolute minimum for 7 = 0. When H > H., only the 
minimum at 7 = 0 remains. In a perfect crystal the effective anisotropy 
field is thus the field at which the magnetization becomes unstable 
with respect to reorientation (assuming it is initially oriented in the 
negative z-direction). If a reorienting field is applied locally (local but 
over a region whose dimensions are much greater than a wall width so 
that the effect of exchange forces can be neglected), then H, is the total 
local field required for the nucleation of a domain at that locality. If 
the nucleation field Hy is understood in this sense, then in a perfect 
crystal q is the nucleation field measured in units of 47M: 





Hy _ 
rT ag q. (74) 


In an imperfect crystal Hy/4rM, may be either larger or smaller than gq. 
If it is larger, the material may be expected to have a high wall motion 
coercivity. 


6.2.2 Susceptibility 


When a transverse bias field H,(y = 7/2) is applied and demagnetizing 
fields are neglected, the energy density becomes 


pz = K, sin’ n — A,M, sin 7 (75) 
which has stable magnetization orientations 
sin’ (#4) = sin’ ( d) , Hye <a; 
n= a (76) 
? A, 
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The transverse susceptibility is therefore 


(H, < H.,) (77) 


0 (7, 2 H.) 
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where M, = M, sin y is the component of the magnetization in the 
direction of H,. Thus, the susceptibility to tipping of the magnetization 
by a transverse field is inversely proportional to q. 


6.2.3 Wall Energy and Wall Width 


Consider now a planar Bloch wall, —-M = 0, between two regions 
whose magnetization at points far from the wall lies along the two easy 
directions, 7 = 0 and 7z, and again assume that there are no applied 
fields or fields produced by boundary surfaces. Under these conditions, 
the magnetic configuration is determined by minimization of the wall 
energy per unit surface area which in this case is (Ref. 9, pp. 189-192) 


— / | a(22) 4 K, sin? | dé. (78) 
Carrying out the minimization results in 
o, = 4(AK,)? (79) 
for 
1 
f= f= l,, log tan (2) (80) 
where 
Spay 
em (e) (81) 


is the wall width. The definition of wall width is somewhat arbitrary 
since the wall extends over all space. In this case, following page 191 of 
Ref. 9, it is chosen so that the magnetization would complete its entire 
rotation of m radians in a length 1, if the entire rotation took place at 
its maximum rate, the rate at the center of the wall. 

The ratio of the characteristic length, equation (67), to the wall 
width is 


q; (82a) 


so that the ratio of the minimum domain diameter, equation (70), to 
the wall width is 


Gait eS g. (82b) 
T 


The approximation of zero wall width thus improves as q becomes 
larger. 
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The approximation that the wall energy is independent of wall curva- 
ture is clearly related to the wall width. At large distances from the 
planar wall equation (80) becomes 


[1 — no] = 2exp(—m|é — & | /lu) (83) 


where np is the appropriate equilibrium orientation of the magnetization 
at a distance far removed from the wall. Such an exponential relation 
will hold for the approach to any stable equilibrium orientation in the 
presence of isotropic exchange. The change in energy of the wall due to 
overlapping of the tails of the wall as the wall is curved is clearly related 
to q, becoming larger as g becomes smaller. In order to solve for the 
dependence of the wall energy on curvature it is necessary to solve the 
entire (including magnetostatics) micromagnetics problems. 


6.2.4 Summary 


The preceding results may be summarized by noting that the higher 
the gq value, the more closely the simple uniaxial model obeys the con- 
straints of the domain model used in the previous sections. It is clear 
that, for domains of the type considered to exist at all, g must be greater 
than one. For device operation, g should probably have a value greater 
than two. 


VII. CONCLUSIONS 


The theory of cylindrical magnetic domains yields conditions which 
predict the size and stability of these domains and provides an estimate 
of the range of applicability of the model used. The results of theory 
appear to be accurate in a range useful in the construction of circular 
domain devices. 

The domains considered are isolated right circular cylinders in plates 
of uniaxial magnetic material of uniform thickness cut so that the plate 
normal is parallel to the easy axis. The first and second order energy 
variations which result from a general small deviation from the strictly 
circular shape determine domain size and stability. The energy method 
was chosen in preference to the magnetostatic field method because of the 
uniformity it provides in accounting for the forces in both the equi- 
librium and stability problems. The integrals arising from the energy 
method are interpreted physically in terms of fields and interacting 
charges. The physical interpretation of the integrals is quite different 
in the limiting cases of very large or very small domains. The integrals 
are related to special cases of the fields of uniformly charged disks 
computed by C. Snow and tabulated by N. B. Alexander and A. C. 
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Downing."’"’” The present work obtains the needed properties of the 
integrals (expansions, recursion relations, and others) directly from the 
definitions. 

When the energy variation is described in terms of a Fourier decom- 
position of the domain radius function, only the generalized force cor- 
responding to a change in domain size is non-zero and the stiffness 
matrix is completely metastable with respect to angle (phase) and 
diagonal with respect to the Fourier amplitudes. Since the Fourier 
amplitude stiffness matrix elements are all found to be distinct, the 
description is unique and may be described as a quasi-normal mode 
description. 

The normal mode description is summarized by a single graph from 
which many domain properties may be determined by construction. 
Cylindrical domains exist only in the presence of a bias field directed 
so as to tend to collapse the domains and having a magnitude between 
0 and 47M, . The uniform radial collapse of the domain and the run- 
out of the domain into an initially elliptical shape bound in the region 
of stability. The minimum attainable domain diameter in a given 
material is dnin = 41 occurring a plate thickness of ~ 41. It is estimated 
that the cylindrical wall approximation begins to become doubtful at a 
plate thickness greater than 4/. In order for cylindrical domains to 
exist, H, ~ 4M, and in general approximations such as the approxi- 
mation of zero wall width become more accurate for H, >417M,(dnin/ly = 
SH,/4n°M, where l,, is the wall width). 

It is interesting to note that since stable cylindrical domains of a 
definite size exist in the total absence of wall motion coercivity and may 
be freely moved, they form a relative, easily observable, classical model 
for illustrating several particle-field concepts. They may be considered 
a two-dimensional particle which is produced as a singularity of finite 
extent in an underlying three-dimensional field (the magnetization). 
Cylindrical domains are particularly useful for demonstrating the con- 
cept of identical particles since, while it is possible to put identifying 
marks on domain locations, it is not possible to mark individual do- 
mains. (Cylindrical domains do exist in two species which may be 
distinguished by the direction of rotation of the spins in the domain 
wall.* All attempts to observe this difference up to the present time 
have been unsuccessful.) 
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APPENDIX A 


Integrals of Cylindrical Domain Theory 


This appendix contains the reduction to standard form of the elliptic 
integrals which arise in the theory of cylindrical domains in plates of 
infinite extent and power series expansions of these integrals. All the 
properties of the functions obtained here are used in either the physical 
interpretation of the energy variation expansion or in generating the 
numerical values of the force and stability functions. 

It is convenient to define functions U and V which appear repeatedly 
in cylindrical domain theory. The elliptic integrals which appear in the 
final results of the theory appear only in the forms U and V, U being a 
function of only the complete elliptic integral of the second kind and V 
being only a function of the complete elliptic integral of the first kind. 
Because of the form of the U and V functions, it has proven easier to 
obtain the needed properties, (such as the series expansions) directly 
from the integral definitions rather than deducing them from the tabu- 
lated properties of elliptic integrals. 

The latter half of this appendix treats the properties of the L, func- 
tions. A recursion relation is obtained and used to reduce the L, to 
functions of U and V. Power series expansions of the L, are obtained 
directly from the definition (84). 


A.1 Definition of the U and V Functions 


The functions are defined in the alternate forms 


U(2) = ) / [x + 3(1 — cosa)]? da (84a) 


a/2 
= 2 [ (x + sin’ B)' dg (84b) 
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aw/2 ; 
25 i iat ay as (846) 
1 ] 
ee (5 : -| (84d) 
and 
Va= | eae eee ee (85a) 
w/2 ‘ 
~ 92 f (c + sin? g)"} dg (85) 
w/2 : 
= 2 / (a + 1 — sin’ y)™? dy (85c) 
1 1 
= 2(2 + 1) x4) (85d) 


where the dummy variables are related by 8 = a/2 andy = 7/2 — a/2 
and where K and EF are the complete elliptic integrals of the first and 
second kind respectively. The argument of the elliptic integrals is the 
parameter m of Abramowitz and Stegun."* The parameter m is equal 
to the parameter k” of Jahnke and Emde or Groebner and Hofreiter.'*'*® 


A.2 Differential Equations and the Power Series Expansion of U and V 
From the definitions (84) and (85) 


du _a 
cape 3V. (86) 
The differential equations obeyed by U and V are 
lot + 2) H+ F|u@ =0 (87a) 
x *) a + 4 |U@) = a 
and 
lotta S+e +n£44|r@ =0 (87b) 
£ %) ap a da + 4 |V@) = 9. 


The U differential equation is verified by substituting in the defining 
relation (84a) and then reducing the resulting equation to the indentity 


= sin @ 
ae ere ies) (880 
_ ff" xcosa+4cosa—i—t} cosa 
= [@+sd —esal  e- = P 
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The V equation is then easily obtained by differentiation of the U equa- 
tion. 

The roots of the indicial equations of these equations are separated 
by 1 [they are 0 and 1 in equation (87a) and —1 and 0 in equation 
(87b)] so that the series expansion of U is of the form 





U(x) = >) Ujzi where U,; = U} + U3 In 16 (89a, b) 
7=0 
and 


(90a, b) 





Vers Vek whee (Vee Vea | 
j=0 
The form of the logarithmic terms has been chosen with some foresight. 


Substitution of the expansions into the differential equations and 
comparing coefficients gives the recursion relations 





Ui, = _G = a) ur p41 (91a) 
pe EG ie re 

| “IGF D ea Gad) D ne ee 
re i+ 3 ay ” ; 

Vita -(3 +1 V; ’ J = 0, (92a) 





ane 
Vig | (i 4) ! |, >0. (92b 
7 Gel ta)" oa”) so =) 
The starting values of V/, Vi’, U} and U%’ are determined directly 
from the integral definitions of the functions (84) and (85) and the 
differential equation (86) relating U and V. This is quite straightfor- 
ward except for V which must be expanded 





w/2 
Va) =2 fie aa + 2 i LOB boas 5) (3a) 


sin at 


and evaluated as a limit 


lim V(x) = 


z—-0 


(93b) 





6 


The limiting value of V may also be obtained quite easily from equation 
(85d) and the tabulated properties of the complete elliptic integral of 
the first kind.** The expansions of U and V are thus 





= ue 2 3 3 665 4, 
u@ =(2+4 t+ a5 3° — Ga? + d4576° + ) 
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od jeg 8 ee 23> oy) 16 
+ (0 PS ge ae poge 9 na | @4) 
and 
1 21 185 
ran torte Aca? nan) 
25 )3 16 
one as Sa Seacen Wie pesbch 5 





A.3 Hxpansion of U and V in Terms of Inverse Powers 


Making a Taylor series expansion of equations (84b) and (85b) 
respectively and integrating yields the expansions of U and V in terms 
of the inverse powers of the argument 


1 ave 7 
U(2) = 22° i (1 + x7" sin’ B)? dB 








= = —2 (2j)! etd Bore 2i 

" RR-IS ? an 

a rh a 

7 0 (27 — Te 16)! [as : ooe) 
1 De oj 8 r Do -3 

welll ghet- gett ho eee] oot) 


and 


w/2 
V(e) = 207! [1 + 27 sin® py! a 


e 





= 32 TIE yo? i sin” 6 dB 


foe ee ee ee ee ee S| 
= rr hs 4% + ca 5g 2 + (97b) 
A.4 Definition of the L, Functions 


The ZL, functions are defined in expression (384) by 


T 


__( = cos na) da 
[x + 4(1 — cosa)]"’ 





L,(2) = x2d0,n20 (98a) 
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or with the change of variable 8 = a/2 


7 w/2 sin’ ng 
I,,(a) = 4 / (e+ sin? 5)! dg, z>0,n>0. (98b) 
It can be seen directly from equation (98b) that for a fixed value of n 
Fale < 0, L,(©) = 0, and L,(x) = 0. (99a, b, c) 
From definitions (84b), (85b), and (98b) 


L,(a) = 2[U(x) — «V(a)). (100) 

The higher ZL functions are determined by means of a recursion relation. 
A.5 The L, Recursion Relation 
The ZL, recursion relation is 


_i_ 


Ln+i (2) = aon + 1 


[4n(2x + 1)L,(x) — (Qn — 1)L,-1(@) — 8nxV(a)), 
n21. (101) 


The recursion relation is verified by substituting in the definitions of 
L,, and V, equations (98a) and (85a), and reducing the resulting equation 
to the identity 


o=/ é. tsin ne[e + $(1 — cos a)]#} de (102a) 
0 
f n cos naz + 4(1 — cos a)] + } sin na sin a 
0 


[a + #(1 — cos a)]? 


The initial functions Z,(x) and L,(x) are given by equations (99c) and 
(100). Note that for large values of x the recursion relation is unstable 
for Increasing n. 


da. (102b) 


A.6 Power Series Expansion of the L, Function 


The function L,(x) is identically zero, equation (99c). The series 
expansion for L(x), obtained from equations (100), (94), and (95), is 


a 18 2, 9 3 5255 4 i>] 
Ina) = (442 16” + 16” ~ {22887 + 


153, 175 4 
— 397 + 59% 


From the recursion relation (101) and the initial functions (99c) and 


| a 1,, | 16 
+ (0-26 +32 +o) Ean | - (103) 
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(100) the general form of L,,(x) is 
L(x) = u,(x)U(x) + v, (2) V(x) (104) 


where u,(x) and v,(x) are polynominals of order n or less in x. Because 
of the form of U(x) and V(x), expressions (89) and (90), an expansion 
of the form 


Ly) Lag" (105a) 
7=0 
where 


16 
= | (105b) 





L,,; = Dy,; + Ln; % In 


clearly exists. 

Expressions for either the coefficients in the polynominals u,(x) and 
v,(x) or the Z,,; may be determined in closed form by similar methods. 
It has, however, proven more useful to use the recursion relation di- 
rectly when the complete expression of the form of expression (104) 
is desired and the expansion (105) when a power series is desired. 

To obtain the Z,,,; the expansion (105) is substituted in the recursion 
relation (101) and coefficients of x are compared to obtain a hierarchy, 
in j, of recursion relations, each member of the hierarchy being factor- 
able and depending only on the preceding member. These recursion 
relations are then factored and successively summed. 

The coefficient of x’ is 


Lass = Fy {lls — On = Dinas + 8nlLasr — Veal, 
n=O (106a) 
where 
Da... = O and V_, = 0. (106b, c) 
With the definition 
Qi = (2n — 1)[Ln,5 — Ln-1,1] (107) 


the second order recursion relation (106) factors into two first order 
recursion relations 


Qn+1.i = On, i =P 8n(Ly, j-1 ao Ve24), n ea 0 (108a) 
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and 
1 


oneal Oissi-4 neil. (108b) 


i Seon = Ln. j + 


The recursion relations for 7 = 0 and 7 = 1 will now be summed. 
From expressions (99c) and (105a) 


Do,; = 0 (109) 
so that using expression (107) 
Q1,, = lh,;. (110) 


For 7 = O using expressions (106b) and (106c), the recursion relation 
(108b) becomes simply 


Qn+1,0 = Qno- (111) 
By inspection of expression (103) the initial value of Q,,.o is 
Qio = Li = 4 (112) 
so that from expression (111) 
Q0=4 nel. (113) 
The recursion relation (108b) thus becomes 
en are ae n=0 (114) 


which with the initial value of expression (109) may be summed to yield 








eed oe. 4 De Mesa (15a) 
4 ary — n>. (115b) 
From the form of the expansion (105) it can be seen that 
L,(0) = Ln.os (116) 
so that with (99a, b) 
ya Sete TOye0. WSs. aH 


123-17 


Tor evaluating the Q,,, and Z,,, sums, two relations are needed: 








n 


n 1 1 n n 1 
4 me 1 eI 
j= 


7=1 
= Seat t 
=] 2k ae 1 


oy 





ow 
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. 1 n 
cae | me 
Want D(a q)-§ ae 
and similarly 
ee ae ea ee. aig) 
j=1 k=1 2k — 1 8 k=1 2k — 1 8 , 


Evaluation of the sums of expressions (118) and (119) is analogous to 
integrating x” log « where n = 0 and 1. With sufficient patience the sums 
can clearly be carried out for any finite n. 

For j = 1, expression (108a) becomes [using expressions (95), (109), 
and (115)] 


. i 16 
= — reas = 
Quiia = Qaa + an 4 a4 mn | z= | n21 (120) 


where the initial function is [using expressions (103), (105), and (110)] 


16 
alt (121) 





Q.=h4,=1—-—Ih 





Summing [using expression (119)] yields 

















16 = : 1 16 
Qatia = 1 — In ae + Se] Sats — tn 5s | 
2 16 2 - 1 0 2 _ 
= —(2n+ 1)" In = | + 4(2n + 1) a aes ae (4n 1), 





n=l. (122) 


The Z,,, recursion relation is then 
16 - 1 
lea Dat | ons. 


n=1 (123) 





Dntia = In. + Qn + »| —in 


which may be summed using the initial value of expression (121) and 
the sums (118) and (119): 


= [1 + Sek +1) | 


x 





Davia = 1— In 


+ 4h +1) Yt p+ (H+ C24) 


k=1 
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0, n= 0. (125a) 
= Jaspal 48 |) ee eee ee 
| 2n'(£ In ; Qn? + (4n 1) UR iT? 


n2=1.  (125b) 


It is clear that the procedure leading to expressions (115) and (125) 
may be carried onward to lead to an expansion of the form 


L, (a) — L,(0) = + ys | > (194, hk) § In 





16) 1G,1 k) 


+ LG, 8) ¥ st) |e (126) 


where the L‘”’(j, k) are functions of j and k only. It is clear that in the 
limit x — 0, 7 — © an expansion in terms of 


yz sn 


may be made where \/h is introduced as the finite expansion parameter. 
Replacing the sum in expression (125) by its approximating integral 


yields | 
(: ) (128a) 


n = wher”, (128b) 


h? 
“>a In 








lim [Z,(z) — L,(0)] = —29 


where 


A.7 Expansions of Lin Terms of Inverse Powers 

The expansion of LZ in terms of inverse powers of the argument is 
obtained by Taylor expansion of expression (98b) and integrating. This 
yields 


a sin? nB 


tate) = 40 [Ea aia Fe 


w/2 
=4 y cHor? os i sin’ nf sin”’ 6 dB, 





n21 (129a) 
where for example 
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By considering the Fourier decomposition of sin’ 6 it can be seen that 


w/2 ; 1 r/2 : 
i sin’ n8 sin” 6 dB = 5 [ sin” 6 dB, n>] 
ca) 0 





= eye n>j (130) 
(independent of n) so that 
Lo(x) = 0, 
iG) eak 3 a Fee + Ole-"P), n> 1 (18la) 
or identifying with expression (97) 
L,(e) = V(z) + Ofe™*?], an > 1. (131b) 


Evaluating expression (129a) directly in those cases in which expressions 
(129b) or (131a) cannot be used yields 


I(x) = 0, (132a) 
— aye Bt aye, LbT 5/2 -7/2 

L,(a) = 72 age -+ 64 2 + O(a "’") (132b) 

L(x) = re? — ico + = a? + Oe”), (132¢) 

L() = na? — Ba 4 “ x? + O(a), (132) 


A.8 The Gaussian Transformation 


In the neighborhood of x = 1, the convergence of the power series 
in zor xz is rather slow. Either the gaussian or Landen transformations 
may be used to transform the U, V, or L, functions into a region of rapid 
convergence.’° In the present case, the gaussian transformation is pre- 
ferred since it does not introduce incomplete elliptic integrals as does 
the Landen transformation. 

The result of the gaussian transformation is 


z, = 4071 + x) (1 + 2)? + a4)’, (133a) 
or inversely 


vy 
= da aya +a) +P abe 
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for the argument and 


V@) = 27 Viz), (134) 
TU() = Us) — F Ves), (135) 
TL,(x) = L,(a,) + i = Tar Va), (136) 
for the functions where 
T=(i+am)i=4+a0'?4a'. (137) 


APPENDIX B 
The Force and Stability Functions 


This appendix is a compilation of expressions for the force F and 
stability S, functions. Each of the functions is written in terms of the 
U and V or L, functions of Appendix A. Expressions in terms of the 
complete elliptic integrals of the first and second kind (denoted by K 
and EF respectively) permit the use of tables’* or numerical computation 
using the Landen transformation or the gaussian transformation.” 
The gaussian transformation is used in Section A.8. The power series 
expansions provided are necessary in obtaining numerical values of the 
functions for either very large or very small values of the argument and 
also provide the asymptotic forms of the functions. The argument of 
the functions is the domain diameter to thickness ratio, d/h = 2ro/h 


B.1 The Force Function 


The force function is written in terms of U by comparing the form 
of I’, expression (40), and the form of U, expression (84b), 


Ha) -1(0)[o(8) 2) a 


This expression is written in terms of the complete elliptic integral of 
the second kind using expression (84d) 


r(2) = = (2) (1 5. y(t 42/2) — i]. (138b) 


It is expanded about h/d = 0 using expression (94) 


d\_1))1, 38 (h\’ _ 38 (h\ 665 (h\’ 
: (2) =14[} T 39 (2) 64 (2) T 34576 (2) + | 
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1 (hy, 3 (h\ 25 (2)" | a 
+p-24 +24 -3E ae ate Koh 


(138¢) 
Additional terms may be generated using expressions (89) and (91). 
It is expanded about d/h = 0 using expression (96b) 


rf) £20) +40 SG) +a) + on 


Additional terms may be generated using expression (96a). 





B.2 The Radial Stability Function 


The radial stability function is written in terms of U and V using the 
definition of the radial stability function [equation (66a)], the expression 
for F [equation (138a)], and comparing the derivative of F' [equation 
(41)] with the form of V [equation (85b)], 


2 ~ 2G)Lo) - Ors) 2 

s,(2) a ee (¢) Nay Na) ap ee | 88) 
or in terms of Z, using the expression for L, [equation (100)] and the 
expressions for L,(0) [equations (115) and (116)] 


s(i)=-2 (9 [u(t)-n0] aw 


Expression (139a) is written in terms of the complete elliptic integrals 
of the first and second kind using expressions (84d) and (85d), 


s(@) = 2) [0.48mi yer 


— (2) + a K[(1 + h’/a’)*] — i]. (139¢) 


The expansion about h/d = 0 is obtained using expressions (1389b) and 


(103). 
h\’ 9 (h\* 5255 [h\® 
(2) -2( + 08 (1) a | 


s()-HL4+8 
“) 


32 
3 fh\’ 15 (h\* 175 [h\° 
+(1-22) +23) - 16 (4) +o |m 
(139d) 
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Additional terms may be generated using expressions (139a) and (89) 
through (92). The expansion about d/h = 0 is obtained using expressions 
(139b), (182b), and (108) to obtain L, (0) 


s(2) ~ : (2) 7 ; (“) T 7 (2) - os (¢) aye RE) 


Additional terms may be generated using expression (129b). 


B.3 The Elliptical Stability Function 


From the general definition of the S, of expression (66b) the elliptical 
stability function is 


s(@) - -4[9() +2) (u(8) -n0}]- oa 


Using expression (139b) for Sy , the Z, recursion relation (101) to reduce 
L, to L,, and V and (103) to obtain L,(0), S2 is written in terms of L, 
and V: 


s(t) = age (i) tt [1+ 8(G) Jee) + 8G) Ge)}- so 


The function ZL, is then eliminated using expression (100) to obtain the 
expression in terms of U and V: 


sft) = a (a) 42 [2 + 864) Joe) + [a GG) + 8G) |v Ge)} 


which then is written in terms of the complete elliptic integrals of the 
first and second kind using expression (84d) and (85d): 


s.(f) = a (42 - [2 + 10ff) | + 5) 2a + wer 
4: |10(#) ES 16(4 me 4 WK K( + h?/@)7]. (140d) 


The expansion about h/d = 0 is obtained using expressions (140b), (95), 
and (103): 


(4) -2\|-3- #0) +30) -O+-| 
+ [1+ 5) - 35,0) + 208 +] 


(140e) 
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Additional terms may be generated using expressions (140c) and (89) 
through (92). The expansion about d/h = 0 is obtained using expressions 
(140b), (97b), and (132b): 


a) Z 2 (a é aie), 3 (4) 

si(¢ ie de eae | ED 
Additional terms may be generated using expressions (140b), (97a), and 
(129b). 

B.4 The General Stability Functions 


Using the definition (66b) and the expression for S, [equation (139b)], 
the S,, are written in terms of the Z,, as 


s(f) «Leh [ull - nlf) - no +10) 


n= 2. (141a) 





The leading term of the expansion about h/d = 0 is obtained using ex- 
pressions (103), (105), (115), (116), (125), and (126): 


d\_1[, |4d]_ 4? -1 Si ju th] (ut 
s.(2) =— in h | 2(n? = 1) ra 23 _ i* 3@ oF 1) + 0, d ) 


n2=2.  (141b) 





The expansion about d/h = 0 is obtained using expressions (115), 
(116), (129b), and (181a): 


of) ait) Betts ec 
[earl (gy i Omusl2)}, n= 2. (1416) 


APPENDIX C 








Symbol List 


Numbers in parentheses are defining equations or figures. 


A exchange constant (71) 

a area 

d mean domain diameter, 27, (42) 

E(x) complete elliptic integral of the second kind (« = m = k’) 
Hy energy due to applied field (9) 

Ey, internal magnetostatic energy (10) 


3334 


Ey 
Ey 
F(a) 
H 

H 

A, 
Hy 
(H..)av 
h 
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total energy (7) 

total wall energy (8) 

generalized radial force function (33, 138, Fig. 3) 
magnetic field vector 

uniform applied field 

anisotropy field 

nucleation field (74) 

z-averaged z-component of magnetic field (45) 
plate thickness (Figs. 1, 4) 

complete elliptic integral of the first kind (x = m = k’) 
uniaxial anisotropy constant (71) 

integrally defined function (34, 98) 
characteristic length, o,,/47M? (67) 

wall width (81) 

magnetization vector 

saturation magnetization 

rotational periodicity (1) 

terms of order k 

quality factor, K,/27M? = H,/4rM, (72) 
cylindrical coordinate (Figs. 1, 2) 

plate radius (Fig. 3) 

nth radial Fourier amplitude (1) 

mean domain radius (1) 

nth infinite plate stability function (66, 189, 140, 141) 
distance between interacting magnetic charges (10b, 24, 28) 
integrally defined function (84) 

unit step function (5) 

volume 

integrally defined function (85) 

cylindrical coordinate (Figs. 1, 4, 5) 

operator (23) 

z— 2’ (21) 

variation in energy (11, 43) 

variation in 1, (8) 

variation in 6, (3) 

dirac delta function 

6’ — 6 (27, Fig. 2) 

polar azimuthal angle (Fig. 5) 

polar angle of wall normal (Fig. 5) 

cylindrical coordinate (Fig. 1) 

nth Fourier phase angle (1) 
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dn wavelength of nth variation (54) 
v azimuthal angle (Fig. 5) 
Veo azimuthal angle of wall normal (Fig. 5) 
E wall displacement vector (Fig. 5) 
p coordinate in displaced cylindrical coordinate system (85, Fig. 2) 
om wall energy density 
re) coordinate in displaced cylindrical coordinate system (35, Fig. 2) 
xX: transverse susceptibility 
Q magnetostatic potential 
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Physical and Transmission Characteristics 
of Customer Loop Plant 


By PHILIP A. GRESH 


(Manuscript received June 17, 1969) 


This report covers the principal physical and transmission character- 
istics of the Bell System customer loop plant. Items covered include a 
statistical characterization of physical composition, measured and cal- 
culated transmission characteristics, and measured noise and crosstalk 
performance. A survey conducted in 1964 provided the data base for this 
report and comparisons of data obtained from a similar survey in 1960 
illustrate that, in many respects the composition of loop plant changes only 
slowly with time. Consequently, the 1964 survey results are believed to be 
representative of today’s plant. 

The types of analyses presented in this paper are of increasing interest 
to certain Bell System customers because of the increasing number and types 
of services provided over local telephone facilities. 


I. INTRODUCTION 


This report covers the principal results of the 1964 Bell System cus- 
tomer loop survey. This survey provides a statistical characterization 
of physical composition, measured and calculated transmission char- 
acteristics, and measured noise and crosstalk performance of customer 
loop plant. Comparisons of data obtained from the 1964 survey and 
a similar survey made in 1960 are also presented. 

Several of the principal transmission characteristics of Bell System 
customer loop plant as defined by the 1960 loop survey were published 
in 1962 by R. G. Hinderliter.* Additional published data on the trans- 
mission characteristics of Bell System toll connections is available in a 
BSTJ article by I. NAsell.” 

The 1964 Bell System survey was comprised of two separate surveys 
which were merged for analysis and presentation purposes. The basic 
survey was the general loop survey which consisted of a simple random 
sample of 1,100 main stations selected from the population of all main 
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stations (45, 300, 000) as of January 1, 1964. However, since only 3.25 
percent of all main stations are served by loops longer than 30 kilofeet, 
only 35 samples would have been obtained to define the characteristics 
of the longer loops. Consequently, a long loop survey consisting of a ran- 
dom sample of 955 main stations served by loops longer than 30 kilofeet 
was obtained. The data obtained from the long loop survey has been 
used in those instances where characteristics are being expressed as a 
function of length to permit better resolution of the characteristics for 
the longer loops. In both of these sub-surveys, official telephones, foreign 
exchange lines, dial teletypewriter exchange (TWX) lines and special 
service lines were omitted as it was felt that their design would not be 
representative of customer loop plant. 


II. SUMMARY OF RESULTS 


Analyses of data obtained in the 1964 loop surveys lead to six general 
results. 


(t) The average customer loop length is 10.6 kilofeet with only 
10 percent of the main stations located beyond 21 kilofeet from their 
serving office. The length distributions show a slight trend toward longer 
loops between 1960 and 1964, with the average loop length increasing 
by 300 feet. 

(11) The average 1 kHz insertion loss of Bell System loop plant is 
3.8 dB and 95 percent of all main stations are served by loops having a 
1 kHz loss of less than 8 dB. At 3 kHz, the average loss is 7.8 dB and 
95 percent of the main stations have less than 17 dB insertion loss. 

(it) The average noise balance of party-line loops is 56 dB, while 
the balance for individual line loops is 69 dB. Only 5 percent of the 
individual line loops have a noise balance of less than 50 dB while nearly 
20 percent of party-line customers are served by loops with less than 
50 dB of balance. The substantially lower balance for party lines is 
largely due to the inherent circuit unbalancing effect caused by the use 
of grounded ringers for party-line service. 

(wv) The average metallic circuit noise (C-message weighted) at a 
customer’s station set is approximately 5.5 dBrne including the noise 
contribution of the central office wiring as well the noise contribution 
of the outside plant facilities. Only 8 percent of the individual lines 
have noise in excess of the Bell System objective of 20 dBrne. However, 
18 percent of the party-line customers have circuits which have noise in 
excess of 20 dBrnc because of the generally poorer circuit balance of 
party-line circuits. 
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(v) Comparison of measured and calculated transmission char- 
acteristics of Bell System loop plant has demonstrated that the outside 
plant cable records are sufficiently accurate’ to permit characterizing 
the loop plant transmission performance by theoretical calculations 
based on the physical composition of the loops as described in the outside 
plant records. 

(vt) Main stations served by loops in excess of 30 kilofeet in length 
were found to be exponentially distributed as a function of working 
length, with the population of main stations reduced by 50 percent with 
every 11-kilofoot increase in loop length (see Fig. 6). It is estimated 
that 1.5 million Bell System customers (3.25 percent of all customers) are 
presently served by loops in excess of 30 kilofeet in length. Due to the 
party-line character of longer loops, the 3.25 percent of all Bell System 
main stations included in the long loop segment of plant used only 
1.7 percent of the working Bell System exchange lines. 


III. DESIGN OF THE SURVEY 


The first steps in the survey were to define the population to be 
sampled and to obtain a complete list of the sampling units. In these 
two surveys (that is general loop and long loop), main stations were 
selected as the sampling units and all Bell System main stations as of 
January 1, 1964, were taken as the population to be sampled. A simple 
random sample was chosen as the sampling plan. 

The sample size of the general loop survey was selected to provide 
data of equal precision to that obtained in the 1960 survey. The design 
parameter chosen was the average distance to the sampled main sta- 
tions, and the precision was measured in terms of the width of the con- 
fidence interval bounding this average distance. The desired confidence 
interval (at 90 percent confidence level) of +450 feet on the average 
cable distance to the sampled main stations dictated a sample size of 
1,100 randomly selected main stations. The actual confidence interval 
obtained was +476 feet. 

In the long loop survey, lack of previous knowledge concerning the 
composition of long loops made it difficult to accurately determine 
the minimum sample size which would provide sufficient precision. The 
design parameter selected for the long loop survey was the average noise 
metallic (C-message weighting) measured at the telephone sets of the 
sampled main stations to a dialed-up termination. The precision aimed 
for was a +1.0 dB confidence interval (at 90 percent confidence level). 
A sample size of 955 main stations was collected, and the confidence 
interval obtained was +0.73 dB. 
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The two surveys had satisfactorily wide geographical dispersion, with 
every associated company (except Canada) contributing to the survey. 
Reference to Fig. 1 will illustrate that the large companies and the 
metropolitan areas contributed heavily to the general loop survey and 
the rural areas contributed heavily to the long loop survey. 


IV. LOOP SURVEY RESULTS—PHYSICAL COMPOSITION 


Data obtained in the loop survey included detailed loop schematics 
indicating the loop composition of each of the loops sampled in the 
survey. All distributions of physical quantities discussed herein were 
derived by analysis of these loop schematics. Since similar data were 
obtained in the 1960 loop survey, comparison of the physical distribu- 
tions obtained in the two surveys has also been made. 

Table I gives a summary of the statistics for the principal physical 
properties of loop plant. Data are included for both the 1960 and 1964 
surveys and significance levels for differences of mean values are pre- 
sented when meaningful. Cumulative distributions of these factors are 
shown in Figs. 2 through 5. The distribution of working bridged tap 
is not given since 82 percent of the sampled main stations were served 
by loops having zero working bridged tap and consequently the dis- 
tribution is not particularly enlightening. 

As indicated in Table I, the estimated average route distance from 
serving central office to main station in the Bell System is 10.6 kilofeet 
with 90 percent confidence that the true mean value lies within +476 
feet of this estimate. Note that although the estimated mean working 
length in 1964 is over 300 feet longer than that estimated in 1960, it is 
not statistically possible to claim that the observed increase is indeed 


TABLE I — 1964 Customer Loorp SuRvVEY SUMMARY 
oF MaIn STATION CHARACTERISTICS 


90% Confidence 
Mean (ft) Limits on Mean (+ ft) Sign. Level for 


Main Station | ————__-—-—__| Difference of Means 
Quantity 1960 1964 1960 1964 in Percent 

Working length 10,288 | 10,613 450 476 * 
Total bridged tap 2,619 | 2,478 169 172 7 
Working bridged tap 381 228 107 74 95 
Airline distance 7,604 | 7,758 353 386 = 
Working length/ 

airline distance 1.45 1.50 0.02 0.03 98 


Drop wire excluded except when individual lengths exceed 400 feet. 
* Levels of significance less than 80 percent indicated by asterisk. 
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an increase. Reference to the cumulative distribution of working length 
depicted in Fig. 3 will, however, show that shifts in the distribution have 
occurred since 1960. Note that the percentage of longer loops increased 
from 1960 to 1964. 

Analysis of the long loop survey data has shown that the Bell System 
main stations served by loops in excess of 30 kilofeet are exponentially 
distributed as a function of working length as depicted in Fig. 6, with 
the main station population diminishing by 50 percent with each 11 
kilofeet increase in loop length. Survey analysis indicates that about 
1.5 million or 3.25 percent (with 90 percent confidence interval of 
+0.2 percent) of all Bell System main stations were located 30 kilofeet 
or more from their serving central offices as of the end of 1964. Due to 
the party-line character of the longer loops, the 3.25 percent of all Bell 
System main stations included in the long loop segment of plant used 
only 1.7 percent of the 39,300,000 Bell System lines working in 1964. 

Analysis of the survey data has also provided valuable insight into 
the type-of-service distribution of Bell System customers and the physi- 
cal composition of the plant provided to meet this distribution as shown 
in Figs. 7 to 10. The type-of-service distribution was derived as a 
function of length to the sampled main station and took advantage of 
the pooling of data from the two surveys. To evaluate the physical 
composition (type of facility, gauge, and pair size) of the loop plant as 
a function of distance, the sampled loops from the general loop survey 
were inspected at intervals of 1,000 feet starting at the central office. 
Both the general loop survey and the long loop survey were similarly 
inspected to define these distributions beyond 30 kilofeet. 

The extent of party-line development as a function of loop length 
is shown in Fig. 7. Note the rapid increase in eight-party development 
for loop lengths greater than 30 kilofeet. Examination of the pair size 
distribution as a function of distance from the central office (Fig. 8) 
shows rapidly decreasing pair size with distance (at the 50-kilofoot point 
50 percent of the sampled loops are contained in cables with fewer than 
16 pairs). Similarly, the distribution of gauge shown in Fig. 9 illustrates 
a rapid transition to coarse gauge with increasing distance from the cen- 
tral office. For example, at 30 kilofeet 60 percent of the sampled loops 
are composed of gauges coarser than 22 gauge. Note also (Fig. 10) that 
the longer loops are primarily developed with aerial facilities. For 
example, 78 percent of all plant is aerial at the 30-kilofoot point from the 
central office. For the longer loops where small pair sizes are used, the 
pole line costs become a significant portion of the total loop costs and 
this factor is one of the reasons for the present trend towards the use 
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of buried plant. Since the sampled loops were randomly selected from 
all existing plant, buried plant is not as prominent as it would be in a 
sample of new construction. Note, however, that beyond 30 kilofeet, 
buried facilities in 1964 accounted for approximately 20 percent of the 
loops. 


v. 1964 Loop SURVEY RESULTS—TRANSMISSION PERFORMANCE 


Data obtained in the 1964 loop survey have provided considerably 
more comprehensive knowledge of the transmission performance of 
customer loop plant than heretofore available. In the 1960 loop survey 
all transmission performance data were developed by deriving equivalent 
“T”? networks from the information supplied on the loop sketches and 
analyzing these networks for transmission performance at nine fre- 
quencies in the voice band. Similar analysis has been performed for each 
of the sampled loops in the 1964 loop surveys, and in addition trans- 
mission measurements were made. The measurements covered noise, 
crosstalk, insertion loss at 1, 2 and 3 kHz, and de resistance. The com- 
bination of these two sets of transmission performance data (one cal- 
culated and one measured) permits three types of analysis: 


(z) changes in transmission performance since 1960 by comparison 
of calculated 1960 data with calculated 1964 data, 
(77) comparison of measured versus calculated data for the 1964 
survey, and 
(777) provision of heretofore unavailable data on the noise and cross- 
talk performance of customer loop plant. 


Since measured insertion loss data was not obtained in the 1960 
survey, comparison of 1960 and 1964 data must be based on calculated 
values. Figure 11 depicts the 1 kHz calculated distributions for both 
surveys. It can be seen that insertion loss performance has remained 
virtually unchanged since 1960. 

For those transmission characteristics where measured data are 
available in addition to the analytically derived data, minor differences 
in performance are exhibited by the two distributions of data (Fig. 12). 
In this regard it is important to realize that the measured data should 
provide a more accurate estimate of performance. There are several 
reasons for greater confidence in the measured data. First, possible in- 
accuracies in cable records or errors in transferring data from the records 
to the loop sketches can introduce errors in the calculated data. Second, 
errors in construction, such as omission or improper connection of load- 
ing coils, cannot be detected from the records. Third, use of calculated 
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data assumes that all cables exhibit nominal characteristics and con- 
sequently do not reflect manufacturing tolerances and environmental 
factors. 

The cumulative distributions of insertion loss at 1, 2, and 3 kHz for 
customer loop plant as derived from both measured and calculated 
data are presented in Fig. 12. These insertion loss measurements and 
calculations were made with a 900 ohm source and load as depicted in 
Fig. 13. Measured loss was found to be consistently higher than calcu- 
lated loss across the entire voice frequency band. The absolute dif- 
ferences between measured and calculated losses are small however, as 
indicated by the differences in mean losses. For example, at 1 kHz the 
measured loss was 3.8 dB and the calculated loss was 3.5 dB. This com- 
parison of measured and calculated insertion losses demonstrates the 
feasibility of characterizing the loop plant transmission performance by 
theoretical calculations based on the physical composition of the loops 
as described by outside plant records. Still referring to Fig. 12, note 
that approximately 95 percent of all Bell System main stations are 
served by loops having a 1 kHz insertion loss of less than 8 dB with a 
mean loss of 3.8 dB. Similarly, at 3 kHz the 95 percent point is 16 dB 
and the mean loss is 7.8 dB. A scatter diagram of the 1 kHz measured 
insertion loss as a function of loop length is shown in Vig. 14. This dia- 
gram was obtained by merging the data from both the general loop 
survey and the long loop survey and indicates that the high loss loops are 
not limited to the long loop category. The high losses observed on some 
of the short loops generally reflect excessive bridged tap. 

An insertion loss measurement of particular interest to designers 
of data equipment is the slope of loss versus frequency from 1000 to 
2750 Hz. Cumulative distributions of the 2750 — 1000 Hz insertion 
loss (insertion loss measured with 900 ohm source and load) have been 
provided for all Bell System loop plant and for those loops serving 
business customers in Figs. 15 and 16 respectively. 

Another important transmission characteristic is return loss, signifi- 
cant from echo and singing considerations. Return loss performance was 
not available from the measured data; consequently, it was cal- 
culated. Table II provides 1964 loop survey return loss results for nine 
frequencies and Fig. 17 presents the cumulative distributions and 
histograms for the 3 kHz singing return loss and echo return loss (equal 
weighting of the 500 to 2,500 Hz band). These data are all developed 
on the basis of looking into the customer loop at the central office end 
of the loop. The return loss is obtained by matching against a 900 ohm, 
2.16 wF termination at the central office, with the customer end of the 
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TABLE II — CaLcuLATED ReEetTuRN Loss From OFFICE 
TowARD STATION* 


1964 


90% Confidence 

Frequency (Hz) Mean (dB) Interval (= dB) 
200 8.0 0.11 
300 10.2 0.12 
500 13.4 0.17 
1,000 15.4 0.30 
1,500 13.1 0.27 
2,000 10.9 0.25 
2,500 9.1 0.20 
3,000 7.7 0.16 
3,200 7.1 0.15 
Echo 11.2 0.15 


* Station end of loop terminated in impedance of ‘‘off-hook’’ station set. 


loop terminated in the impedance of the ‘‘off-hook” station set. Similar 
data on return loss are presented later from the station end of the loop. 

The comparison of measured versus calculated loop resistance shown 
in Fig. 18 indicates that for general loop plant there is no significant 
difference between measured and calculated data but calculated loop 
resistance is slightly higher than measured resistance (574 ohms cal- 
culated, 567 ohms measured). Measured values may have been in- 
fluenced by the fact that measurements were made during the winter. 
Calculated resistances were based on an average temperature of 68°F. 

Theoretical calculations cannot be made of all transmission perform- 
ance characteristics. Two such examples are noise and crosstalk. Since 
these characteristics are dependent upon external influences (induction 
from adjacent power lines, cable pair balance and the particular pair 
assignment), field measurements were made on each of the sampled 
loops using a Western Electric Company model 3A noise measuring set. 
The noise and crosstalk measurements were made as depicted in Figs. 
19 and 20. 

It is convenient to analyze loop noise in terms of the two factors 
which contribute to the resultant interference. The first of these is the 
magnitude of open circuit longitudinal voltage induced from power 
lines and the second is the circuit balance of the cable pairs and central 
office equipment. The cumulative distribution of the open circuit longi- 
tudinal voltage for general loop plant is shown in Fig. 21 for 3 kHz flat 
weighting. This voltage is induced in a longitudinal mode, and conse- 
quently only that portion of it which is converted to the metallic circuit 
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will create an interference problem. The circuit balance reflects the 
extent to which the longitudinal voltage is converted to metallic voltage 
and is, therefore, a measure of the susceptibility of the telephone plant 
to inductive interference such as power-line hum.* 

As seen in Fig. 22 party lines are much more susceptible to power- 
line hum than individual lines because of the unbalance introduced by 
the grounded ringers associated with party-line station sets. On the aver- 
age, individual lines have approximately 12 dB better balance than party 
lines. Part of this is a result of the shorter length distribution of in- 
dividual lines which offers less opportunity for cable pair unbalances 
to accumulate; but it is reasonable to expect a balance improvement of 
10 dB with ringers isolated from ground. 

The combination of the induced longitudinal voltage and the circuit 
unbalance produces the metallic noise distribution at customers’ station 
sets (in off-hook state) as shown in Figs. 23 and 24. For comparison 
purposes, the C-message weighted noise to ground (longitudinal noise) 
is also shown on these figures. Figure 23 depicts the noise contribution 
of the loop plant only, while Fig. 24 includes the noise contribution of 
the central office wiring. In both cases the station end of the loop was 
terminated in an off-hook 500-type station set with the transmitter 
and receiver replaced by equivalent resistors. The metallic noise has been 
measured with C-message weighting to reflect the relative interfering 
effects of the noise on voice transmission. The important limits to con- 
sider are the Bell System long-term noise performance objective of 20 
dBrne and the immediate remedial action limit of 80 dBrnc. As seen 
in Fig. 24, only 8 percent of the individual lines had total metallic 
noise in excess of 20 dBrnc. However, 18 percent of the party-line cus- 
tomers have noise in excess of 20 dBrne. 

The near-end crosstalk coupling loss characteristics of customer loop 
plant as derived from measured data from the general loop survey are 
shown on Fig. 25. Along with the overall distribution of crosstalk 
coupling loss is shown the distribution for nonloaded loops only (84 
percent of all sampled main stations). The nonloaded loop distribution 


* Loop circuit noise balance is defined here as 


open circuit longitudinal voltage 


20 logte metallic voltage 


where both voltages are measured with C-message weighting. The validity of this 
definition depends on the assumption that the longitudinal voltage induced from 
adjacent power lines is the only source of metallic noise. This is generally not true 
when the noise to ground measures less than 20 dBrne, and consequently loop 
balance for such loops cannot be computed from the measurements. 
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can be approximated by a normal distribution with a mean crosstalk 
coupling loss of 115 dB and a standard deviation of 12 dB. A comparison 
of these two curves indicates that the poorer crosstalk performance of 
longer loaded loops dominates the low loss tail of the general loop survey 
distribution. 

The final transmission characteristic to be discussed is the loop input 
impedance as calculated both at the central office and at the station set. 
Figure 26 presents the plot of loop input impedance as seen at the 
central office as a function of frequency (not including central office 
wiring or equipment). For these calculations the station end of the 
loop was terminated in an off-hook 500-type station subset with the 
transmitter and receiver replaced by equivalent resistors. Curves have 
been provided separately for loaded and nonloaded loops because of 
the large difference in their characteristic impedance. Also shown is the 
characteristic impedance of the central office matching network as a 
function of frequency. The function of this network is to provide high 
return loss performance across the voice frequency band by matching 
as close as possible the impedance of the various loops. It is apparent 
that both the nonloaded loops and loaded loops should have their 
highest return losses around 1 kilohertz and that the loaded loops 
should perform more poorly than nonloaded loops at the lower fre- 
quencies. 

Plots of mean input impedances, such as in Fig. 26, are useful for 
indicating the general input impedance behavior as a function of fre- 
quency. Variations that occur at each frequency, and their effects on 
return loss, are best shown as scatter diagrams. Figures 27 and 28 
present the loop input impedance at 1 kHz for nonloaded and loaded 
loops. Superimposed on all scatter diagrams are return loss circles 
referenced to the 900 ohm and 2.16 uF matching network. Any loop 
having an impedance lying within a particular circle will have a return 
loss, when measured against the specified matching network impedance, 
which exceeds the given return loss value. Visual examination of the 
scatter pattern as it relates to the return loss circles provides a ready 
means of evaluating the return loss performance of various segments of 
the loop plant (assuming, of course, that the input impedances of loops 
in that segment are known). 

The range and shape of the input impedance scatter pattern at each 
frequency are of interest because they point up the difficulty of designing 
a simple matching network which, at even a single frequency, will 
provide very high return losses for nearly all loops. Considering the 
characteristics of the nonloaded loops shown in Figure 27 it is evident 
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that many of the loops tend to follow a smooth curve, while the others 
are scattered about this curve. The smooth curve results from the varia- 
tion in loop length, while the scatter is due to the effects of bridged tap, 
overgauging, and variations in types of subsets. 

Perhaps of particular interest to Bell System customers are the input - 
impedance characteristics of Bell System loop plant as seen from the 
station end of the customer loop. The input impedance of a customer 
loop as measured at the station set can vary considerably based on the 
type of facility connected to the loop at the central office. Various cir- 
cuit connections may involve use of four-wire trunks, two-wire trunks 
or intraoffice circuits. In the following analysis a 900 ohm and 2.16 uF 
central office termination has been used to represent a four-wire trunk 
termination, and the midsection input impedance of 22 gauge H88 
loaded cable has been used to represent a two-wire trunk. For the simula- 
tion of intraoffice calls, a Monte Carlo technique was used to select a 
random sample of 500 pairs of loops from the 1,100 loops in the general 
loop survey. A loop was randomly selected as the sample loop, and then 
the input impedance (from the central office) of another randomly 
selected loop was chosen for the central office termination. 

The 1,100 loops of the 1964 general loop survey were segregated into 
two groups (loaded and nonloaded loops) for all but the simulated 
intraoffice calls because of the great differences in impedance range of 
the two populations. Presentation of scatter diagrams of input im- 
pedance from the station set has been limited to 1 and 3 kHz. These 
return loss circles were generated assuming the use of a 500-type station 
set and it was further assumed that the 500 set was operating on a cur- 
rent equal to the average loop current of 45.5 mA. 

Figures 29 through 32 are the input impedance scatter diagrams for 
loops with a simulated two-wire trunk (22 gauge H88 loaded cable) 
termination at the central office. The scatter is primarily a result of 
overgauging, open wire, and bridged tap or varied end section length. 
Smoothed curves of the mean input impedances of loaded and non- 
loaded loops with a 22 gauge H88 cable termination are presented in 
Fig. 33. Scatter diagrams for the loops with a central office termination 
of 900 ohms and 2.16 uF (simulated four-wire trunk) are presented in 
Figs. 34 through 37. The general input impedance behavior of these 
loops as a function of frequency is indicated in Fig. 38 by the plot of the 
mean input impedances at nine voiceband frequencies. 

The scatter diagrams and the mean input impedance curve for the 
simulated intraoffice calls are shown in Figs. 39 and 40. The effect of 
connecting together two loops, one of which is terminated by a station 
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set, and the other whose input impedance is calculated at its station 
set, is shown by the mean input impedance curve for simulated intra- 
office connections in Fig. 41. This curve has a shape characteristic of 
longer nonloaded loops. The mean input impedance curves for non- 
loaded loops with simulated two- and four-wire trunk terminations at 
the central office are also shown in Fig. 41. The major differences in the 
characteristics of these curves are at the low frequencies where the shunt 
capacitance of the cable masks the termination less than it does at high 
frequencies. 
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Fig. 1— Geographic distribution of sampled loops for the 1964 customer loop 
survey. Eleven hundred loops were sampled for the general loop survey and 955 
loops were sampled for the long loop survey. 
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Fig. 2—- Working length to the main station. 


1960 1964 


Mean (feet) 10,288 10,613 
90 percent confidence 
limits on mean (feet) +450 +476 
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Fig. 3— Total bridged tap. 
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Fig. 4— Airline distance to main station. 
1960 1964 


Mean (feet) 7,604 7,758 
90 percent confidence 
limits on mean (feet) +353 +386 
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Fig. 5 — Ratio of working length-airline distance to main station. 


1960 1964 
Mean 1.45 1.50 
90 percent confidence 
limits on mean +0.02 +0.03 
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Fig. 6— Distribution of long loops (1964 long loop survey). The mean was 
45,938 feet; 90 percent confidence limits on the mean was +870 feet. 
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Fig. 7— Type of service distribution versus loop length (1964 combined loop 
surveys). One-, two-, four-, and eight-party plots include residence service only; 
business includes PBX, centrex, and coin. 
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Fig. 8— Pair size distribution (1964 combined loop surveys—general loop and 
long loop surveys). 
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Fig. 9.— Gauge distribution (1964 combined loop surveys—general loop and 
long loop surveys). 
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Fig. 10— Type of construction distribution (1964 combined loop surveys— 
general loop and long loop surveys). 
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Fig. 11— Distribution of calculated insertion losses at 1, 2, and 3 kHz 





(--- 1960; 1964). 
1 kHz 2 kHz 3 kHz 
1960 1964 1960 1964 1960 1964 
Mean (dB) 3.4 3.5 5.4 5.3 7.4 7.3 


90 percent confidence 
limit on mean (dB) +0.11 +0.10 +0.18 =+0.16 +0.24 +0.21 
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Fig. 12— Distribution of measured and calculated insertion losses at 1, 2, and 


3 kHz (—-—-—- calculated ; ————— measured). 
1 kHz 2 kHz 3 kHz 
Meas- Calcu- Meas- Calcu- Meas- Calcu- 
ured lated ured lated ured lated 
Mean (dB) 3.8 3.5 5.6 5.3 7.8 7.3 


90 percent confidence 
limit on mean (dB) +0.12 +0.10 +0.17 +0.16 +0.22 +0.21 
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Fig. 13— Insertion loss measurement technique. The oscillator is set for 600 
ohm output termination; the 3 A noise measurement set is equipped with 900 
ohm termination. 
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Fig. 14— Measured 1 kHz insertion loss scatter diagram (1964 combined loop 
surveys—general loop and long loop surveys). 
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Fig. 15 — Distribution of insertion loss slope between 2750 and 1000 Hz for 
residential plus business loops. 
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Fig. 16— Distribution of insertion loss slope between 2750 and 1000 Hz for 
business loops only. 
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Fig. 17— Distribution of 3 kHz and echo return losses at central office. Echo 
return loss distribution assumes flat weighting of 500-2500 Hz band. 








Echo 3 kHz 
Mean (dB) 7.7 11.2 
90 percent confidence 
limits on mean (dB) +0.16 +0.15 
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Fig. 18 — Measured and calculated distribution of resistance to main station. 


Measured Calculated 


Mean (ohms) 567 574 
90 percent confidence 
limits on mean +15.8 +17.0 
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Fig. 19— Noise measurement technique for (a) noise metallic—loop only and 
(b) noise longitudinal—loop only. 
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Fig. 20— Crosstalk measurement technique. Disturbing loop—randomly se- 
lected pair in the same 100-pair group as the sample loop on which the measure- 


ments are being made. The oscillator on disturbing loop is set for 600-ohm 
output termination. 
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Fig. 21— Noise to ground at main station for 3 kHz flat weighting. The mean 
was 49.2 dBrn; the 90 percent confidence limits on the mean was +0.56 dBrn. 
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Fig. 22— Loop circuit noise balance. These distributions are based on the 476 
loops where measurements permitted an accurate estimate of loop balance to be 
made. These are for loops having noise to ground greater than 20 dBrnc. 


Party Individual 


Mean (dB) 56.1 68.7 
90 percent confidence 
limits on mean (dB) 1.55 +0.87 
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Fig. 23— Noise metallic at main station with C-message weighting for loop 


plant only. 


Mean (dBrnc) 
90 percent confidence 
limits on mean (Bdrnc) 


Noise to 
Ground 
19.1 





+0.5 


Noise Metallic 
All Individual Party 


—1.1 —3.1 - 6.7 
+0.5 +0.5 +1.3 


CUSTOMER LOOP PLANT 3371 


100 








80 

z 

2 

E 

b 

> 60 

< NOISE TO GROUND 
= ~~ (LONGITUDINAL ) 
w 

| 

- 40 

Zz. 

ud 

19) 

ao 

Ww 

oO 








-12 -6 ro) 6 12 18 24 30 36 42 48 
NOISE METALLIC IN dBrnc 


Fig. 24— Noise metallic at main station with C-message weighting for loop 
Sine ‘plus central office. 
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Fig. 25—1 kHz near end crosstalk coupling loss at central office. Central of- 
fice terminated in 900 ohms; customer end terminated in receiver off hook sta- 
tion set. 
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Fig. 26— Loop impedance at central office. 
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Fig. 27 — Nonloaded loop input impedances at 1 kHz measured from the cen- 
tral office. Return loss circles based on 900 ohm + 2 ywF network. 
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Fig. 28— Loaded loop input impedances at 1 kHz as measured from the 
central office. Return loss circles based on 900 ohms ++ 2 uF network. 
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Fig. 29— Nonloaded loop input impedances at 1 kHz measured from station 
sct with a simulated two-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 30 — Nonloaded loop input impedances at 3 kHz measured from station 
set with a simulated two-wire trunk termination at the central office. Return 
loss circles based on 500-type subset impedance. 
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Fig. 31— Loaded loop input impedances at 1 kHz measured from station set 
with a simulated two-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 32— Loaded loop input impedance at 3 kHz measured from station set 


with a simulated two-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Tig. 33 — Mean value of loop input impedance from station set with simu- 
lated two-wire trunk termination at the central office. 
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Fig, 34 — Nonloaded loop input impedances at 1 kHz measured from station 
set with simulated four-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 35— Nonloaded loop input impedance at 3 kHz measured from station 
set with simulated four-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 36 — Loaded loop input impedances at 1 kHz measured from station set 
with simulated four-wire trunk termination at the central office. Return loss 
circles based on 500-type subset impedance. 


3382 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 
































REACTANCE IN OHMS 





Ens S ae 
Saree 

aS a ae 
Seat hett 

| | I 

Leak eed 


ll 
PSP R= id 
COPA RSE 


ie) 100 200 300 400 500 600 700 800 900 1000 1100 {i200 1300 
RESISTANCE IN OHMS 


for ae an 
aise see 


Fig. 37 — Loaded loop input impedances at 3 kHz measured from station set 
with simulated four-wire trunk termination at the central office. Return loss cir- 
cles based on 500-type subset impedance. 
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Fig. 38 — Mean value of loop input impedance from station set with simulated 
four-wire trunk termination at the central office. 
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Fig. 39 — Input impedance of all loops at 1 kHz measured from station set 
with a simulated intraoffice circuit termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 40 — Input impedance of all loops at 3 kHz measured from station set 
with a simulated intraoffice circuit termination at the central office. Return loss 
circles based on 500-type subset impedance. 
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Fig. 41— Mean input impedance of nonloaded loops measured from station 
set with: Curve A—simulated four-wire trunk 900 ohms + 2 uF central office 
termination; Curve B—simulated two-wire trunk 22-gauge H-88 cable central 
office termination; and Curve C—simulated intraoffice cally condition. 


Radio-Relay Antenna Pointing for 
Controlled Interference with 
Geostationary Satellites 


By C. W. LUNDGREN and A. 8. MAY 
(Manuscript received July 23, 1969) 


We present analytical methods (i) for calculating microwave radio re- 
fraction for negative and positive initial ray angles accounting for station 
height and (ii) for determining refraction-corrected ranges of antenna point- 
ing azimuth within which mutual interference with geostationary satellites 
in shared frequency bands is likely. 


I, INTRODUCTION 


When radio-relay and communication satellite systems share fre- 
quency bands, as they do at 4 and 6 GHz, it is necessary to impose 
restrictions on both systems so that interference is not excessive. The 
CCIR (International Radio Consultative Committee) recommends 
that radio-relay antennas maintain a specified angular separation with 
respect to the geostationary (stationary equatorial) orbit or, where 
this is not practicable, the application of power limitations to terrestrial 
radio transmitters involving reception at the satellite. While the above 
restrictions protect satellites, designers of terrestrial systems should 
be aware of possible interference into radio-relay systems from satellite 
radiation arriving at low elevation angles and close to the on-beam 
directions of receiving antennas. Because the dielectric constant of the 
earth’s atmosphere varies with altitude, the radio-relay beam is not 
straight, and atmospheric refraction must be considered when computing 
the directions of radio beams for which the restrictions apply. 


1.1 Simplified Exposure Model 


Figure 1 introduces the geometry of the problem and illustrates 
significant trends and limits. Radio-relay site P located at North Lati- 
tude ¢ degrees is shown as viewed from above the earth. An are of the 
geostationary satellite orbit is also shown. The orbit longitude of point 
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S, and the earth longitude of site P coincide numerically (that is, 8, 
is in the direction of true south, PS/ , observed from P). 

Two radio rays from P intercept the orbit symmetrically at points 8. 
The controlling geometric relationships are based upon triangles formed 
by points S, P, and geocenter O. Angle O-P-S is determined by the 
elevation angle of the radio-relay antenna including ray bending due 
to atmospheric refraction. A special case is depicted in Fig. 1, wherein 
unbent intercepting rays PS and also ray PS/ are assumed to lie in the 
local horizontal plane at P and remain tangent to the earth sphere at P. 
Thus, angles S-P-S/ are always antenna pointing angles to orbit inter- 
ception referred to “true south.” 

It is instructive to visualize the relationship between latitude ¢ at P 
and the location of orbit intercepts 8, for a given fixed triangle OPS. 
As points § approach 8, , the constraints imposed above require that 
the station latitude » approach a maximum latitude ‘‘visible” to the 
orbit. The resulting single pointing direction to orbit intercept is due 
south (from P to S,). Conversely, the maximum separation between 
points § and S, obtains when ¢ is zero (for site P located on the equator). 
Since both intercepting rays are tangent to the equator at P, the limiting 
pointing directions are due west and due east. 

Note that a rotation in azimuth of the antenna (about the local 
vertical axis at P) between known orbit-intercept directions results in 
radio rays PS which fall below the orbit as viewed from P; rotations 
beyond these ‘‘critical azimuths” result in rays above the orbit. 


1.2 Computations 


Given the latitude and elevation angle of a microwave radio-relay 
antenna, one can calculate the pointing azimuth for which, neglecting 
refraction, the main beam axis intercepts the geostationary orbit. This 
calculation is repeated to produce screening charts like Fig. 2. Such 
charts are adequate for quickly determining a hazard condition, but 
often the true critical azimuths must be approached closely while main- 
taining tolerable interference levels. 

A graphical procedure proposed for the convenience of system plan- 
ners provides pointing angle estimates for most stations when caution 
is exercised in those steps accounting for atmospheric refraction.’ An 
analytical technique adaptable to machine calculation is also required 
for rapid, accurate screening of large numbers of existing and proposed 
radio-relay sites for potential interference exposures. Precise evaluations 
are required for cases of unavoidable exposure. 

The method described in following sections can be used by the system 
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Fig. 2— Screening chart, neglecting refraction. 


planner to determine critical azimuths and the ranges of azimuths to be 
avoided (accounting for atmospheric refraction) for prescribed minimum 
angular separations between main beam axes and the geostationary 
orbit. 

The Central Radio Propagation Laboratory (CRPL) Exponential 
Reference Atmosphere is adopted for the generation of microwave 
radio refraction curves by accounting for station heights and negative 
antenna elevation angles, for several representative refractive indexes.” 
Earlier extrapolations are based upon upper limits of the total bending 
associated with assumed earth-grazing rays.°"* 

A refraction anomaly arising from a temperature inversion, storm, 
ducting, or other departure from an assumed representative radial ray- 
bending model precludes an absolutely confident evaluation of any given 
exposure at a given time. These phenomena are usually localized. How- 
ever, the intent of these computations is to protect the geostationary 
orbit against continuous interference arising simultaneously from a large 
number of terrestrial systems. 

Following a development of the refraction model, we derive the critical 
pointing azimuth corresponding to orbit intercept. The geocentered 
longitude displacement between the radio-relay site and the point where 
the refracted beam intercepts the orbit is next determined by spherical 
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geometry. Then the apparent slope* of the geometric orbit trace, as if 
viewed unbent by refraction from the station, is obtained corrected for 
aspect (also dependent upon antenna elevation angle and concomitant 
refraction). Using the refraction data converted to geometric elevation 
angles (as without ray bending or obstruction), the geometric orbit 
is adjusted to the apparent position and shape that would be observed 
at the radio site. This apparent orbit is hereafter termed the refracted 
orbit. Subsequent sections describe the determination of azimuth zones 
to be avoided for prescribed beam—orbit separations and the permis- 
sible transmitted power. Appendices are included to: provide means for 
estimating initial ray angles from commonly available radio-relay in- 
formation when actual antenna angles are unknown; solve by manual 
calculation a representative numerical example, giving the applicable 
equations for each step; and verify governing equations using a different 
analytical approach. 


II, DETERMINATION OF REFRACTION FOR POSITIVE AND NEGATIVE ANTENNA 
ELEVATION ANGLES 


2.1 Ray Tracing Equations 


In the following equations 9, is the initial anglet of a ray as it leaves 
the earth’s surface and 7 is the total refractive bending corresponding 
to @,. Figure 3 illustrates this relationship and shows the geometric 
director with elevation angle ¢ to an intercept with the geostationary 
orbit at S. Also shown on an are through S centered at P is the apparent 
position of the intercept S, and a horizontal reference, zero-elevation 
point A. The latter relationships are used in a subsequent section to 
describe a method for constructing the refracted orbit. 

Angle ¢ is approximately, but in general not identical to, 4 — 7. 
However, this approximation is reasonable for rays between terrestrial 
antennas and geostationary satellites well beyond the earth’s atmosphere 
when relatively small effects of parallax associated with the controlling 
portion of the atmosphere near the earth’s surface are neglected.’ 

Figure 4 depicts a ray entering the earth’s atmosphere and the result- 


* The first derivative with respect to pointing azimuth (azimuth-elevation plot 
of the orbit) is more completely defined in Section V. 

+ 4. is generally used in ray-tracing equations to denote the initial ray elevation 
angle with respect to the local horizontal and is synonymous with a. used in 
subsequent sections to denote a radio-relay antenna beam elevation angle 
(namely, @ in Figs. 6). 

+ Calculations using equation (5) show that the actual ray angle with respect to 
the geocenter differs from that resulting from the assumption e = 6. — r by ap- 
proximately 1.5 minutes of arc for limiting conditions used herein. 
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ORBITAL PLANE IN EDGE VIEW 


Fig. 3— Refracted ray. 


ing path of the ray due to the refractive gradient. Depending upon the 
angle of arrival of the ray when it enters the atmosphere, the refraction 
causes it to intercept the earth, graze the earth, or become tangent to a 
unique earth-centered sphere at some height above the surface. If the 
ray does not intercept the earth it continues out into space again, being 
subjected to approximately the same refraction in exit as it encountered 
upon entering. 

At any point on the ray trace, the angle the ray makes with the local 
horizontal at that point is denoted by 6. The angle between the tangents 
to the ray at any two points (a, b) is a measure of the refractive bending 
between them and is denoted by r(a, b). 

The following presentation of refraction is based largely upon equa- 
tions given in Refs. 2, 6, and 7 for zero or positive initial ray angles and 
as interpreted by the authors for application to negative angles. Appro- 
priate uses of the equations and their application to the problem are 


7(3,1) T(1,2) 





Oo 


Fig. 4— Maximally refracted nongrazing ray. 
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explained, but the reader is directed to the references for complete 
derivations and limitations including sensitivities to the microwave 
frequencies involved. 

The radio refractive index n varies with atmospheric pressure, relative 
humidity, saturation vapor pressure and temperature. The lower limit 
for n is unity—no atmosphere, while the upper limit is determined by 
local climatic conditions. For the southeastern section of the United 
States, a typical range of n at mean sea level is 1.00025 to 1.0004. In 
equations involving refraction it is convenient to express refractivity 
as N where N = (n — 1) X 10° or, for this case, N values are 250 and 
400 (N units). The average decay of N is approximately exponential 
with height and the difference in N between the surface and a height 
of 1 km above the surface is given by” 


AN = —7.32 exp (0.005577N,). (1) 


The subscript s denotes N at the surface. The decay constant with 
height is expressed by” 


N, 
C, = In AN (2) 
N at any height h (kilometers above the radio site surface elevation) is” 
N, = NS exp [—C.(h oF h)I, (3) 


where h, is the surface elevation above mean sea level corresponding 
to N,. 

N, is sensitive to local elevations and hence, charts of N, for moun- 
tainous regions are difficult to use because the NV, contours are irregular 
and closely spaced. Therefore, obtaining an appropriate value of N, 
for use in equations (1), (2), and (3) for a particular site is often dif- 
ficult. However, charts are available giving N, reduced to mean sea 
level equivalents N, which, in effect, reduce the height-dependent N, 
values to a common base.’ Since charts of N,, are more easily interpreted 
and a single value of NV, usually applies over a large geographical area, 
they are used in this paper. 

N, and N,(h) are related® by N, = N, exp (—h/7). Conversely, for a 
given value of NV,, N, for surface height h, above mean sea level is 
determined from the expression 


N, = N, exp (=h/0); h, = 0. (4) 


The N, value obtained from equation (4) is used in equations (1), (2), 
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Fig. 5(a) — Radio refractivity conversion — N. to Ns (or Nz). 
Fig. 5(b) — Radio refractivity conversion — Nr to Nr. 


and (3). Figure 5a was generated using equation (4) and, for manual 
calculations, is convenient for determining J, . 

The conversion between NV, and N, above considers only the dry air 
effects of temperature and pressure related to the difference in elevation 
between mean sea level and the station surface height, whereas con- 
stants of the expression given for AN account for all terms contributing 
to a change in refractivity of the atmosphere with elevation above the 
surface height. 

A relationship exists between angle 0, at the point of origin of the ray 
and @ at any other point on the ray trace which is expressed by 


(1 +N, X 10°°)(a + h,) cos 6, 
=(1+N,X10°)(a+h)cos9=C, (5) 


where C is a constant and a is the earth’s radius at mean sea level (in 
kilometers). Thus, angle @ for any point on the ray trace is determined. 
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The incremental bending of the ray between closely spaced points 
on the trace is given by 


(1, 2):S -| oot 6. (6) 


As suggested by Schulkin, the term 1/7 is taken as unity with an error 
of less than 0.0001 in the computed refraction and, for an iterative 
solution, equation (6) is expressed as’ 


Ane 
r(1,2)=—| cot @dAn. (7) 
Any 


Shulkin also shows’ that equation (7) is approximated by (Anl — 
— /An2)/6, where Anis n — 1 and 6,, is (@1 + 62)/2. Hence the in- 
cremental bending between two closely spaced points on the ray trace 
is expressed by 


2(N, > N2) x 10°° 
0, + 9 


where NV, and N, , 6; and @, are the N values and ray angles (in radians), 
respectively, at the closely spaced points. 


r(1, 2) = rad., 0 < 6, < 10°, (8) 


2.2 A Method for Calculating Refraction 


The following paragraphs describe a procedure for calculating re- 
fraction curves of the form in Figs. 6 for any values of N, which is also 
applicable for the direct calculation of refraction corrections. 


2.2.1 Zero and Positive Initial Angles (+6,) 


First assume a value of N,, a station height h, , and an initial ray 
angle @, . Equation (4) is used to determine JN, for height h, and equa- 
tions (1), (2), and (8) to determine AN for a height h, where h = h, + 
Ah*. Equations (5) and (8) then give the ray angle 6 at the incremental 
height and the bending 7 in the first increment. For each successive 
increment of height, the previously solved-for values of N and @ become 
the initial values for equations (3), (5), and (8). The values of 7 for each 
iteration are accumulated to give the total bending for h, and @,. 
Repeating the above procedure with other values of @, results in data 
points to be used in plotting the refraction curve for the assumed sta- 
tion height h,. A complete repetition of the above, beginning with 


* Incremental heights must be small in the lower atmosphere where n changes 
rapidly but may increase at the higher elevations. However, for the generation of 
Fig. 6, a constant increment of 0.25 km was used to a height of 90 km. 
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Fig. 6 — Refraction versus antenna elevation angle for (a) No = 250, and (b) 
No. = 300. 


other station heights, results in a family of curves (namely, h,) for 
zero and positive initial ray angles. 


2.2.2 Negative Initial Angles (—9,) 


The calculation of refraction for negative initial ray angles requires 
a modification of the technique used for positive angles. Note in Fig. 4 
that the bending of the ray from T3(h; , —@3) to T1 is the same as that 
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Fig. 6 — Refraction versus antenna elevation angle for (c) No = 350, and (d) 


N, = 400. 
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of the ray from T1 (h, , 86, = 0) to T3. The latter is determined utilizing 
the equations for positive angles but in a slightly different manner. 

First, assume a grazing ray (h = 0, 6, = 0) and, using the iterative 
method previously described, compute the bending 7 and angle @ at 
each of the specific elevations h, desired for the chart, that is, in the case 
of Figs. 6:1, 2, and 3 km. The value of 7 for each height is then added to 
the maximum 7 determined for the same height (when 6, is zero as 
found in Section 2.2.1) to obtain the total bending for a ray with the 
initial angle, —@,. In Fig. 4 this is 7(T3, T1) plus 7(T1, 8S). Repeating 
the above but beginning with h = 1, 2, and so on, provides the data 
needed to extend the curves for positive initial angles into the negative 
range. 

Note in Figs. 6 that all refraction curves terminate on a dotted ex- 
tension of the zero-elevation ray representing a maximally refracted 
grazing ray. For a given height with an initial angle more negative than 
that represented by the point of termination, the ray is intercepted by 
the earth. 


2.2.3 Direct Machine Calculation 


For machine calculations it is desirable to compute the refractive 
bending directly without the use of refraction tables. For zero and 
positive initial ray angles, the calculation is straightforward as is des- 
eribed in Section 2.2.1. However, for a ray originating at a specific height 
with a negative initial angle @, , it is first necessary to determine the 
height of the ray where @ is zero (h, at Tl in Fig. 4). A variation of 
equation (5) permits this determination. If the right side of equation 
(5) represents point Tl in Fig. 4 and the left the initial station, say T3 
at height h; , then 


(1+ Nos X 107°)(a + hrs) cos 4, 
= (1 + Nw x 10°°)(a + hr) — Cc. (9) 


The variables for the left side of equation (9) are all determined and 
evaluation yields the constant C. Then, values are assumed for hy, 
beginning with zero, N;, determined, and the right side evaluated until 
the result is equal to C. After hy, is determined, 7(T3, T1) and 7(T1, 8) 
are calculated as described in Section 2.2.2 and added numerically 
to give the total refractive bending. However, when h7, is assumed to 
be O for the initial iteration and the right side is larger numerically than 
C’, the beam intercepts the earth and a solution with angle 9, is not 
possible. In such cases it is often desirable to determine the initial graz- 
ing-ray angle to the mean sea level horizon, which is accomplished by 
incrementing 6, upward until the equation is satisfied. 
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2.2.4 Angle from the Transmitting Site to the Radio Horizon 


The radio horizon, accounting for refraction and average local terrain, 
is determined by a variation of equation (9). Assuming that the height 
of a receiving station represents the average terrain height for some 
distance beyond, the initial angle ag (= 0,) of a ray which just grazes 
the terrain represents the angle to the radio horizon as seen from an 
elevated transmitting site. Letting the left side of equation (9) represent 
the transmitting location, the right side the receiving location (where @ 
is zero), and solving for ay yields 


a [A+ Ne X10) (a + ha) | 
Qy = COS E oe N- x 10°*)(a a hr) rad, hy = he » An < 0, 


(10) 


where the subscripts z and ,; refer to the receiving and transmitting 
stations, respectively. Nz is obtained by equation (4) and NV 7 by equa- 
tions (1), (2), and (3), substituting Np, and hz for N, and h,, and hp 
for h. (For manual calculations, Np and Ny may be determined from 
Figs. 5. First enter Fig. 5(a) with hz , N, , and read Nz from the ordinate. 
Then enter Fig. 5(b) with (hr — hz), Nz and read N ; from the ordinate.) 


2.2.5 Refractive Index Limits 


We now illustrate the importance of including the effects of refraction 
in orbital computations relating to terrestrial radio-relay systems. 
Table I (reflecting the use of equations and techniques discussed in 
subsequent sections) demonstrates parametrically the sensitivity to 
radio refractivity of the orbit-intercept pointing azimuth and the com- 
puted terrestrial transmitter power limitation. The 8 dB variation in 


Taste J—INFLUENCE OF REFRACTION 


Station Statistics Assumed Values 


Path azimuth 103.5 degrees from true north 
Station latitude 55.0° north 

| Station elevation mean sea level 
Antenna elevation angle a, 0 degrees 


Parametric results Computed values 


Radio refractivity N,, N units 0 250 400 
Geometric elevation angle, «., degrees 0 —0.555 —1.27 
Critical azimuth (from north), degrees 102.6 101.76 100.7 . 
Maximum transmitter power, dBW 47 50.7 55 
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the power shown in the table indicates clearly that refraction must be 
included in calculations and that limits for refractivity should be care- 
fully considered. 

World charts of N, in Ref. 8 indicate appropriate limits of N, are 
250 and 400. At specific locations and for short time intervals, the index 
may not fall within these limits. However, localized conditions will 
not affect large numbers of stations at any given time and a wider 
range of N, would unnecessarily broaden the restrictive zone for radio- 
relay systems. 

We suggest that the above limits be adopted for standardized cal- 
culations. For a specific case where an antenna pointing angle is close 
to the orbit and transmitter power limitations are restrictive, refractive 
limits applicable to that locale should be used. 


2.2.6 Adjustment of Computed Geometric Orbit Traces for Atmospheric 
Refraction 


For many solutions, particularly those involving graphical procedures, 
it is desirable to “elevate”? a computed geometric orbit trace to its 
apparent (refracted) position and shape. Such an adjustment yields a 
presentation permitting a given, arbitrarily-shaped radio beam power 
profile to be related unrefracted and hence undistorted to the easily 
obtained configuration of the refracted orbit. Figures 7 are charts to 
enable this manipulation, produced from Figs. 6 by plotting (a — 7.) 
versus 7,. A method for using these charts is given in Section VI. 


III. DETERMINATION OF THE POINTING AZIMUTH TO ORBIT INTERCEPT 


Recall that the elevation angle (with respect to local horizontal): of 
the geometric director shown in Fig. 3 is denoted by e. Hence, station 
geometric elevation angle e, may be replaced by a, — 7,, where a, 1S 
the initial antenna beam elevation angle and 7,, is the corresponding 
refraction correction. (A method for determining a, is given in Appendix 
A.) 

Note that available information for established radio-relay routes 
in the United States giving antenna elevations, path distances, and 
antenna elevation angles is generally expressed in units of feet, statute 
miles, and degrees, respectively; it is necessary to convert these quanti- 
ties into kilometers and radians for use in many of the expressions which 
follow. 

Inspection of Fig. 8 shows that the azimuth displacement from the 
meridian through a station located at P to an intercept with the geo- 
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stationary orbit is identical to angle A of spherical triangle PES’. 
From laws for right spherical triangles 


tan ¢ 
mee (11) 





cos A = 


where ¢ is the latitude at station P, and @ is the are equivalent of angleO 
of plane triangle OPS. Note that 8 is numerically equivalent to the 
maximum visible latitude for assumed radio-relay antenna beam eleva- 
tion angle a, and concomitant total ray-bending angle 7,,, . 

Angle 8 is determined from triangle OPS using the Law of Sines. 
This triangle is redrawn in Tig. 9, where 


? 


sin Q _ sin (7/2 + «,) 
a R 


Q = sin” (K™ cos «,), (12) 


I 


where e¢, is the station geometric elevation angle corresponding to a, , 
a is the earth radius, and R is the orbit radius, R/a = K. From inspec- 
tion, 8 = 7/2 — Q — e, . Substituting for Q yields 


8B = cos’ (K~* cos &,) — & . (13) 


Substituting equation (18) for 8 in equation (11) results in 


> a4 tan ¢ ; 
aes E [cos * (K~* cos €,) — 5| (14) 


IV. DETERMINATION OF RELATIVE LONGITUDE BETWEEN SITE AND ORBIT 
INTERCEPT 


The earth longitude displacement between radio-relay site P and 
suborbital intercept S’ in Fig. 8 is side \ of right spherical triangle 
PES’. From the Law of Sines 


sind _ sme 
sin A : 





from which: 
\ = sin™’ (sin A sin 8), (15) 
where 8 and A are found by equations (13) and (14). Note that when 


A = x/2, corresponding to g = 0, the maximum visible longitude 
displacement is 8. 
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Fig. 7— Refraction versus geometric elevation angle for (a) No = 250, and 


(b) No = 300. 


V. GEOMETRIC ORBIT TRACE—CORRECTED FOR ANTENNA ELEVATION AND 
ATMOSPHERIC REFRACTION 


' The geostationary orbit and earth’s equator are coplanar; hence the 
orbit near the horizon normally appears to be tilted with respect to the 
local horizontal plane. Were an equatorial orbit sufficiently distant, 
as in celestial observations, the angle of tilt would equal the colatitude 
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Fig. 7— Refraction versus geometric elevation angle for (c) No = 350, and 
(d) No — 400. 
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of the observer’s position. However, the maximum visibility circle in 
Fig. 8 is not a great circle, so the angle of tilt is less than the colatitude 
except where the latter circle crosses the equator. In the following para- 
graphs, an expression is developed for the tilt of the geostationary orbit 
as if viewed from a unique location on the visibility circle. This tilt is 
defined hereafter as the apparent slope of the orbit trace (or first deriva- 


N 


GEOSTATIONARY ORBIT 





to 


Fig. 8— Geometry relating site P and geostationary orbit intercept. 


ANTENNA POINTING , = 3405. 


£0 


p 
~ 


R 


Fig. 9— Geometry of intercept-determining triangle. 


tive of the orbit trace, in elevation, with respect to direction, in azi- 
muth).* 

The size of the visibility circle in Fig. 8 depends upon a, and Ta. ; 
for given site latitude ¢, the angular displacement in earth longitude \ 
between site P and point S where the refracted beam intercepts the 
geostationary orbit (longitude of suborbital point 8’) also depends upon: 
a, and rt... Hence, the slope 6 of the geometric orbit trace constructed 
in Fig. 11 also depends upon a, and r,, . 

Note in Fig. 8 that the angle between orbit tangent t, constructed at 
S and the local vertical plane at P through § is angle ¢ of spherical 
triangle PES’. From the Law of Sines 

¢ = sin” em ey, (16) 


sin 6 





The complementary angle between orbit tangent t, and the plane of a 
circle generated by radius CS (perpendicular to OP) is denoted by 
6’ = 1/2 — ¢. Substituting equation (16) for ¢ yields 


peat et. (SID a) i 

6 cos (ame B (17) 
Angle 6’ is viewed in true magnitude from O (or 8’), but is seen from 
radio-relay site P as a smaller angle 6 when rotated through angle Q 


as shown in Fig. 10. Note that tan 6’ = y/x and tan 6 = (y cos Q)/2, 
from which 


= tan” (tan 6’ cos Q). (18) 

Combining equations (17) and (18) yields 
= tan” {tan [cos * (sin ¢/sin 8)] cos Q}. (19) 
~* The orbit trace is envisioned as the locus of all pointing angles to the orbit, 


plotted on an azimuth-elevation chart aligned and calibrated. according to the 
location of the observer. 
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Fig. 10 — Development of geometric orbit slope as observed from site P. 


VI. CONSTRUCTION OF THE REFRACTED ORBIT 


Figure 11 presents the geometry of the problem viewed from a radio- 
relay station (similar, but not equivalent to the presentation given in 
Ref. 1). Reference to Figs. 1 and 3 may assist in interpretation of Fig. 11, 
wherein points A, S, and §, correspond to similar points in Fig. 3 and 
intercept S corresponds to the left (easterly) intercept S in Fig. 1. Origin 
A is the beam-intercept direction from site P calculated from equation 
(14), accounting for atmospheric refraction. 

Since the orbit is tilted with respect to the local horizontal plane at 
site P, the elevation angle of the geometric director to the orbit con- 
tinuously varies as the orbit is scanned in azimuth. The refractive bend- 
ing of a ray is a function of the geometric angle, so that the position of 
the apparent, or refracted orbit, is above the geometric orbit and it 
exhibits a constantly changing slope with respect to the latter. 

The bent, refracted orbit is shown through point §, (apparent posi- 
tion of interception point § with refraction; also, the antenna elevation 
angle a, at the azimuth origin). The straight line labeled ‘geometric 
orbit”’ is tangent at S (Fig. 8) to a radial projection of the geostationary 
orbit on a sphere of radius PS centered at P. The horizontal line shown 
through §S also represents an arc, in edge view of a great circle through 
S, parallel at S to the local horizontal plane at P, on this same sphere. 
Hence, the angle 6 obtained from equation (19), corrected for refraction 
while retaining the concept of a constant site latitude ¢, is also accurately 
represented by the apparent slope of the plane-figure geometric orbit 
trace at the azimuth origin. Figure 11 illustrates the construction of 
the corresponding refracted orbit trace. 

The equation of the linear geometric orbit trace through § is 


e = —tan (5)AA + a — Tao; (20) 
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where ¢’ is the elevation angle of the geometric orbit trace corresponding 
to an arbitrary displacement in azimuth AA from origin A. 

Figs. 7 are entered with values of ¢’ derived from equation (20) to 
obtain total refractive ray-bending angles r,.. The refracted orbit 
trace in Fig. 11 is then constructed by plotting points with coordinates 
(A + AA, a’) and connecting these with a smooth curve, where 


a =i tre. (21) 


VII. AZIMUTH DISPLACEMENT FROM INTERCEPT TO KEEP THE BEAM CENTER 
AN ANGULAR SEPARATION ¥ FROM (AND BELOW) THE MINIMALLY 
REFRACTED ORBIT (NV, MINIMUM) 


Figure 11 also illustrates a solution to keeping an angular separation 
v between the center of a circular beam and the geostationary orbit. 
The circle centered on S, and labeled “unrefracted beam’’ is a cross- 
section of a conical figure of revolution with apex at antenna site P 
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Fig. 11— Orbit geometry as observed from site P. 
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and 2v included angle (locus of all rays having angle v with respect to 
the beam center). If elevation angle a, of the antenna is fixed by radio- 
relay path parameters, the orbit can be avoided only by an azimuth 
displacement of the beam. The required angular separation v between 
the orbit and the beam center results when the latter is moved in azi- 
muth away from intercept until the unrefracted cone is just tangent 
to the refracted orbit trace. Note that an identical relationship exists 
between the elongated ‘“‘refracted beam” and the geometric orbit trace. 
However, since the former presentation is easier to develop and can 
readily accommodate unsymmetrical beam cross-sections, it is pre- 
ferred for following analyses. 

Figure 12 represents the analytical solution of an illustrative problem 
given in Appendix B and is helpful for visualizing subsequent pro- 
cedures. From inspection 


Bete (en tl) a (22) 


M 


where a’ is any point on the refracted orbit corresponding to arbitrary 
displacement JZ from an azimuth intercept. Note that in the range of 
interest M is negative with respect to azimuth A,,;, , obtained from equa~- 
tion (14) using e, = a, — Tyo. Within a few degrees of intercept, the 
geometric orbit is approximated by a line of constant slope 6. Recall that 
adjustment of the geometric orbit for refraction results in a refracted 
orbit trace having a constantly changing slope with azimuth displace- 
ment from the intercept. This displacement for a given elevation of the 
refracted orbit is found by solving equation (22) for M: 


— (al = ta) = (@o = Tao), 
ee tan 6 (23) 


Since the refracted orbit slope varies with elevation angle, no direct 
mathematical solution exists for determining that azimuth displacement 
providing an angular separation v between the beam center and the 
refracted orbit, measured in a direction normal to the latter. However, 
it is closely approximated by determining the slope of an orbit segment 
including the region of interest. Figure 12 suggests that the refracted 
orbit slope is everywhere less than the geometric slope. Hence, two 
judiciously chosen points on the refracted orbit having elevations 


al = a, + v cos 6, a2 =a, +2, 


always bracket the appropriate segment. Figures 6 and 12 also show 
that at these elevation angles the differential refraction is small. This 
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Fig. 12 — Graphical representation of the analytical solution. 
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supports the assumption that the slope of the small orbit segment in 
this region is virtually constant. 

The refraction corresponding to elevation angles a, , al, and a2 are 
calculated or determined from the charts. Substituting a,, r.., al, 
and 7.1 in equation (23) gives the azimuth displacement M1 corres- 
ponding to al. Similarly, 1/2 is determined by substituting a, , Tao; 
a2, and ra2. Slope 61 between the two chosen points on the refracted 
orbit is given by 


tan 61 = (a2 — al)/(M2 — M1), 
resulting in 
61 = tan” [v(1 — cos 6)/AM], (24) 


where AM = M2 — M1. 

Figure 12 shows that azimuth displacement AA necessary to keep 
the beam center an angular separation v from the refracted orbit is 
S.c (equal to S,o plus oc).* By inspection, oc = v/sin 61, od = v/tan 61 
and S,o = M2 — od. Assembling these into an equation for AA yields 


AAnin = M2 — v/tan 61 + v/sin 61. (25) 


7.1 Special Case 


If a ray with initial angle a, intercepts the earth, a solution is obtained 
by calculating angle a, to the radio horizon as is described in Section II. 
Then, substituting e, for e, in equation (14), the orbit intercept for a 
grazing ray is determined. When solving for the necessary azimuth 
displacement from this intercept, a, and 7,7 are substituted for a, and 
Teo in equation (23). Note, however, that in determining a1 and a2 for 
substitution in equation (23), the use of ray angle a, remains valid. 


VIII. AZIMUTH DISPLACEMENT FROM INTERCEPT TO KEEP THE BEAM 
CENTER AN ANGULAR SEPARATION Y FROM (AND ABOVE) THE MAXI- 
MALLY REFRACTED ORBIT (VN, MAXIMUM) 


The left side of Fig. 12 shows that the refracted orbit for the case 
of maximum refraction falls below the radio horizon with azimuth 
displacement from point A,,,x obtained from equation (14) using ey = 
Qy — Tan- Since the earth intercepts all rays below the horizon, they 
cannot affect the orbit and it is only necessary to displace the beam 


ia * Notations such as S.c represent scalar distances between indicated points in 
ig. 12. 
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center in azimuth sufficiently to maintain angular separation v from the 
horizon intercept. The required displacement is 


AAmz = [v? — (a — an)’]*. (26) 


In Fig. 12, ay(—0.25°) is the initial angle of a ray which grazes at a 
recelving station height of 0.4 km and originates at an assumed trans- 
mitting height of 0.5 km. 

As demonstrated in Appendix B, little increase in the critical zone 
results if it is assumed that the angle to the radio horizon is equal to 
the initial ray angle. Therefore, for manual calculations involving small 
positive values of a, , these angles are assumed identical so that AA 
is simply an angular separation v from the orbit intercept determined 
for a,. For values of a, more negative than that for a grazing ray 
(determined from Figs. 6 or by calculation), it is necessary to determine 
a, and the corresponding orbit intercept using equations (10) and 
(14). Then equation (26) gives the necessary azimuth displacement. 


1X. DETERMINATION OF THE CRITICAL ZONES 
9.1 Critical Zones Defined 


The critical zones to be avoided at radio-relay transmitting sites to 
protect the geostationary orbit are defined: 


Zerit = Arias + BRA ves to Anais =, DAG 
(degrees from South), (27) 


where values for A and AA for maximum and minimum refraction are 
obtained as in Sections III, VII, and VIII. These zones are converted 
to azimuth zones with respect to true north by subtracting the bound- 
aries from 180 degrees for the easterly zone and adding them to 180 
degrees for the westerly zone. . 

Calculations for stations in southern latitudes are identical except 
that they are referenced to north rather than south. The easterly azi- 
muth zone with respect to true north is obtained directly from equation 
(27), while the zone boundaries are subtracted from 360 degrees for the 
westerly zone. 


9.2 Special Case 


At latitudes exceeding ¢ = cos’ (K™ cos Y) — W it is impossible 
for the beam’s center ray to be below the orbit with angular separation 
v.* Hence, for such extreme northern latitudes, a single critical zone 


* Derivation of this equation is given in Appendix C. 


3412 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


spans south with both easterly and westerly boundaries defined by 
Zerit = Amax + AAmax (degrees from south). (28) 


X. DETERMINATION OF MAXIMUM PERMISSIBLE RADIATED POWER 


As mentioned in Section I, international agreements exist for maxi- 
mum. radiated powers.” For 6-GHz radio-relay transmitters whose 
antennas point within v = 2° of the geostationary orbit, the power 
limitation for separations less than 0.5° is 47 dBW relative to the iso- 
tropic case (EIRP), increasing 8 dB per degree to a maximum of 55 dBW 
(occurring at 1.5°). This limitation refers specifically to the center of 
the major lobe. For this case, only the relationship between the refracted 
center ray of the beam and the geometric orbit is considered. 

If an existing or proposed path direction is between the critical values 
computed as in Section III for maximum and minimum refraction, the 
refracted center ray is likely to intercept the orbit for appreciable periods 
of time. For such cases the maximum power is 47 dBW. 

If the path direction for a system in the northern hemisphere is within 
the critical zone but nearer to south (smaller) than A,;,, the actual 
separation p illustrated in Fig. 12 is 


p = (Amin — Ap) sin 6, (29) 


where A, is the path direction measured from true south. 

Conversely, if A, exceeds Axx, the separation of the beam center 
from intercept of the geometric orbit and the refracted horizon (Fig. 12) 
is 


p= (A, = Armax) =r | €y — €o 3, (30) 
where 
€y = GAH ~ TaH 3 Eo = Ao —~ Tao « 


Note that ¢«, in equation (30) represents maximum atmospheric refrac- 
tion. 

If a ray having initial angle a, intercepts the earth, e, for use in equa- 
tion (30) is indeterminate. For such special cases, a conservative ap- 
proximation for the angular separation is p = A, — Armax - 

The maximum permissible effective radiated power P, dBW (EIRP) 
is given for separation p according to the following criteria: 


pS 05°, .P, = 47; 
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05° <p <1.5°,  P, = 8(p — 0.5) +47; (31) 
p>15°, P, = 55. 


XI. CONCLUSIONS 


A direct analytical method involving few approximations and assump- 
tions can be used by system planners for calculating refraction-cor- 
rected ranges of pointing azimuth for microwave radio-relay antennas 
within which significant interference with geostationary communication 
satellites can be expected. Required angular separations between the 
refracted beam and the geostationary orbit are translated into required 
azimuth displacements of a radio-relay antenna from that calculated 
for orbit intercept; conversely, for cases where exposure is unavoidable, 
means for determining the maximum transmitted powers permitted 
by international agreement are presented. 

Since all analytical expressions including refraction corrections are 
readily amenable to machine calculation, both speed and improved 
accuracy in estimating the pointing azimuths are possible. The sug- 
gested refractive index limits are believed to be representative for the 
large majority of exposures and useful for a standardized approach to 
the problem. For more general applications, where refractive index 
variations are known to be different, one may use the same principles 
to generate his own applicable correction curves. 
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APPENDIX A 


Estimation of Antenna Elevation Angles 


The initial beam elevation angle for a radio-relay antenna is deter- 
mined by the geometry of transmitting and receiving locations, the path 
length, and refraction. The final alignment based upon transmission 
measurements, if recorded, is preferred for these calculations. The 
antenna elevation angle for proposed radio-relay paths can be estimated 
using the method given below with sufficient accuracy. — 
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Figure 13 depicts radio-relay transmitter T and receiver R at eleva- 
tions OT and OR above geocenter O, assuming a spherical earth of radius 
ka. Coefficient k is the ratio of apparent earth radius to true earth radius 
and accounts for refraction in the lower atmosphere.” The path length 
is represented by are D (great circle length at mean sea level). The trans- 
mitting and receiving antenna heights with respect to mean sea level 
are denoted by hy and hz , respectively. 

Inspecting Fig. 18, 6 = D/ka radians, C’ = 2(ka + hr) sin (6/2), 
and a, = EH — ¢/2 radians. From triangle TT’R and laws for plane 
triangles 


(he — hr) sin (r/2 — $/2) 


tant = GE ig = hn) 608 (/2 = 6/8) - 
It can also be shown that 
a, = tan™ | ea ta = be —___| — D/2ka radians, (33) 
tan (D/2ka) X (2ka + he + hr) 


where hz , hy , D, and a are expressed in the same units. 
Reference 2 provides a formula and a table relating k and N,. For 
most calculations, a value of k = 4$ results in sufficient accuracy”® (the 





MEAN SEA 
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Fig. 13 — Geometry of antenna elevation angle. 
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geometries of radio-relay systems within limits of normal site elevations 
require antenna elevation angles which are relatively insensitive to 
values chosen for k). 


APPENDIX B 


Illustrative Calculation 
Problem Input for a Particular Radio-Relay Site: 


Latitude ¢ — 88°N, 
Transmitter Height h-— 0.5 km, 
Receiver Height hz — 0.4 km, 
Path Length D — 28 km, 
Path Azimuth A, — 97.75° with respect to true north, equivalent 
to 82.25° from south towards east, 
N, Limits — 250 and 400 N units. 


B.1 Antenna Elevation Angle 
From equation (33) and using k = 3 
oe 0.4 — 0.5 
ue Sx TW SO OTL | 
28 
— 360 x 6873 
— 0.00523 rad, which converts to —0.3°. 


I 


B.2 Geometric Elevation Angle for N, = 250 
From Fig. 6a 
éo = A — Tao = —0.3 — 0.59 = —0.89°. 
B.3 Azimuth Intercept for N, = 250 
From equations (13) and (14) 
8 = cos [0.1509 cos (0.89°)] + 0.89 = 82.2°, 
Amin = cos [tan (38°)/tan (82.2°)] = 83.86° from south. 
B.4 Orbit Slope for N, = 250 | 
From equations (12) and (19) 
Q = sin” [0.1509 cos (0.89°)] = 8.68°, 
6 = tan™’ [tan {cos™’ [sin (88°)/sin (82.2°)]} cos (8.68°)] = 51.26°. 
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B.5 Azimuth Displacement for v = 2° below the Orbit 
From equations (23), (24), (25), and Fig. 6(a) 


Qo = —0.3°, Tao = 0.59°, 

al = —0.3 + 2 cos (51.26°) = 0.95°, Tar = 0.35°, 

a2 = —0.3+ 2 = 1.7°, Ta2 = 0.28°, 
M1 = [0.95 — 0.35) — (—0.3 — 0.59)] + tan (51.26°) = 1.19°, 


M2 = [(1.7 — 0.28) — (—0.3 — 0.59)] -+ tan (51.26°) = 1.85°, 
51 = tan” {2[1 — cos (51.26°)]/0.66} = 48.62°, 
AAnin = 1.85 — 2/tan (48.62°) + 2/sin (48.62°) 


2.76° toward south from intercept. 


; 
cll 


B.6 Horizon Intercept oF N, = 400 
From equations (1), (2), (3), (4), (10), (12), (18), (14), and Fig. 6(d) 
Nez = 400 exp (—0.4/7) = 377.78 or from Fig. 5(a), 
AN = —7.32 exp (0.005577 X 377.78) = —60.2, 
C, = In [877.78/(877.78 — 60.2)] = 0.17, 
Ny = 877.78 exp [—0.17(0.5 — 0.4)] = 371.28 or from Fig. 5 (b), 


opie | 1.000877 6373.4 
°8  £1.000371 ** 6373.5 


Il 





ay 


| = —0.25°, 


Tan = 1.22°, 

éy = —0.25.— 1.22 = —1.47°, 

68 = cos’ [0.1509 cos (1.47°)] + 1.47 = 82.79°, 
Ama = cos’ [tan (38°)/tan (82.79°)] = 84.33° from south. 

B.7 Azimuth Displacement for v = 2° from Horizon Intercept 
From equation (26) 
A Ame = (2)? = (0.8 + 0.25)"}) 
= 1.99° toward north from intercept. 


B.8 Critical Zone 


From equation (27) 
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Zerit = 84.83 + 1.99 to 83.86 — 2.76 
= 86.3° to 81.1° from south. 


B.9 Azimuthal Zones 
True-north azimuths: 
93.7° to 98.9° easterly, and 
266.3° to 261.1° westerly. 
The path azimuth of 97.75° falls within the easterly azimuthal zone. 
B.10 Relative Longitude to Suborbit Intercepts 
From equation (15) 
Amin = Sin” [sin (83.86°) sin (82.2°)] = 80.04°, 
\max = sin’ [sin (84.33°) sin (82.79°)] = 80.82°. 


B.11 Maximum Permissible Radiated Power 


Comparing the path direction of 82.25° from south with the critical 
zone found in Section B.8 and with A,,;, and A,,,x found in Sections B.3 
and B.6 reveals that equations (29) and (81) are appropriate for calculat- 
ing the angular separation and permissible power: 


= (88.86 — 82.25) sin (51.26°) = 1.28°, 
P, = 8(1.23 — 0.5) + 47 = 52.8dBW at 6 GHz. 


B.11.1 Alternate Power Calculation 


An alternate calculation is indicated when the path direction is further 
from south than A,,x. Assume that A, is, instead, 85.25° from south 
(left side of Fig. 12). The appropriate equations are now (80) and (81). 
A value of e, for maximum refraction is required for equation 0) 

From Section B.5, Fig. 6(d), equations (80) and (31) 


a, = —0.38°, 
ee Mes ag 
e, = —1.6°, 


p = [(85.25 — 84.33)? + (—1.47 + 1.6)’]? = 0.93°, 
P, = 8(0.93 — 0.5) + 47 = 50.4 dBW at 6 GHz. 
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APPENDIX C 


Alternate Derivations of Basic Equations and Critical Latitudes 

All following relationships are expressed in terms of latitude ¢ of radio- 
relay site P, shown in Fig. 14 (a constant), and the maximum latitude 
for which the refracted geostationary orbit is visible to the antenna 
(a constant elevation angle). 
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Fig. 14— Angle ¢; between orbit tangent and local vertical plane O-S-Py. 
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c.1 Latitude Limit of Visibility for the Geostationary Orbit 


Figure 14 shows that the maximum latitude visible to the geosta- 
tionary orbit is 


max = cos [K7 cos (a, — Tao) — &@ — Tao (34) 


B, 


ll 


where 


a, = known elevation angle of the radio-relay antenna, 

Tao = corresponding total ray-bending angle due to refraction in- 
ferred from CRPL Exponential Reference Atmosphere’ or from Figs. 6 
of text, and 

K = ratio of orbit radius R to assumed earth radius a. 


Hence, equation (34) is a restatement of equation (18). 


c.2 Pointing Azimuth to Orbit Intercept 


The larger of angles formed by intersection of site latitude circle ¢ 
and the visibility circle corrected for antenna elevation and refraction 
shown in Fig. 14 is a direct measure of the pointing azimuth to be avoided 
for a station located at that intersection (P,). Hence, angle Ay between 
tangents T, and T, is the supplement of pointing angle A given by 
equation (14) referred to true south. From the geometry of Fig. 14 


sin a) 


Ay = cos” {—sin ian (oo ase) 


cos B 
r/2<|Av| <7. (85) 


Equation (85) yields two pointing azimuths which are of interest; 
one in the second quadrant referred to true north, corresponding to a 
westerly direction for stations in the northern hemisphere—and one 
in the third quadrant, or easterly direction. Reversed directions result 
for stations in the southern latitudes. 

Figures 15 illustrate changes of variables which simplify the demon- 
stration of equivalence between equation (85) and equation (14). Since 
cos’ 6 = 1 — sin’ 6B and cos’ 9 = 1 — sin’ g, 


sin a) 


Ay = cos” {sin Es (Ha. Se) 
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ees | tan (Leos = eos’ |), 
cos 8 


Substituting » and » (Figs. 15) and reducing the result yields 
Ay = cos’ (—sin p COS »). 
Now, substituting for » and v provides 
Ay = cos” [—(in ¢/sin 6)(cos B/cos ¢)] 


tan 9 


ese (-= e) : 


which is exactly the supplement of the angle obtained from equation (11) 
in the text. 
c.3 Longitude Displacement of Orbit Intercept 


The earth-longitude displacement between radio-relay site P, and 
suborbital intercept 8’ in Fig. 14 is also inferred from equation (35): 


eg 2 p\k 
y= tan” | L0 — sin’ 2 — cos? 8] O0<|A\| < 2/2, 


cos B 
= sin’ (sin A sin ), (36) 


from which first and fourth quadrant longitude adjustments are referred 
to the suborbital longitude. The equivalence with equation (15) is 
demonstrated using techniques indicated above and identifying angle 6 
uniquely with the maximum latitude for visibility ¢,,., (Section C.1). 


c.4 Geometric Orbit Trace 


Since the size of the visibility circle in Fig. 14 depends upon a, and 
Tao, the earth-longitude displacement \ between P, and S (same 
longitude as suborbital point 8S’) also depends upon a, and 7,, . Because 


BSS: et 


{ 
SIN @ | (cos? p~cos2a)| 


|(sin2g—sin2¢)3| cos B 


(a) (b) 


Fig. 15 — Change of variables: (a) angle w and (b) angle pv. 
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the resulting viewing aspect depends upon these parameters, the slope 
of the geometrical orbit trace constructed in Fig. 11 also depends upon 
a, and rT, as well as ¢. 

Note that the angle between orbit tangent t constructed at S and the 
local vertical plane at P, through S is denoted by ¢,. This angle is 
observed undistorted at point 8’, but appears to be a slightly enlarged 
angle ¢/ for an observer at P, (as if point S were visible without ray 
bending): 


¢! = tan” (tan ¢,/cos Q). (37) 


Hence, complementary angle 6 between orbit tangent t and a line 
through S perpendicular to the geometrical line-of-sight SP, and parallel 
to the horizontal plane at P, is also the slope of the corrected geometric 


orbit trace shown in Fig. 11, 
5 = cot’ (tan ¢,/cos Q) 


he { tan [sin7 (sin ¢/sin 6)] ! 
7 cos {tan™* [sin B/(K — cos £)j} 


= tan™’ {tan [cos”’ (sin ¢/sin 8)] cos 2}, (38) 


for which equivalence to equation (19) is shown using the techniques 
incorporated in Sections C.2 and C.3. 


c.6 Maximum Latitude Permitting Angular Separation v Below the Orbit 


A maximum latitude exists for each antenna elevation angle allowing 
the beam-center ray to be below the refracted orbit with prescribed 
angular separation v. Figure 16 illustrates the determination of this 
critical latitude. S, is a point on the geostationary orbit having zero 
relative longitude with respect to station site P,. A ray emanating from 
the antenna with initial vertical angle (7/2 + a, + v) must just inter- 
cept the orbit. Accounting for refraction, the angle between the geo- 


N 





Fig. 16— Geometry determining the critical latitude. 
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metric director of the elevated ray and the local horizontal is V = 
[a, + — T.4,4)]- Site latitude v and W are related by triangle OP.S, . 
From the Law of Sines 
sin (7/2 +) _ sin Q 
R os ONG 


Letting R/a = K, 


Q = sin’ (K™ cos W). 
SinceQ + 9 + 7/2+ V7 = 7, 
oe = cos (K7* cos W) — W. (39) 
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An Extended Correlation Function of 
Two Random Variables Applied to 


Mobile Radio Transmission 


By W. C.-Y. LEE 
(Manuscript received June 16, 1969) 


The definition, properties, and application of an extended correlation 
function of two random variables involving two common parameters are 
described and applied to mobile radio systems. The correlation functions of 
a predetection diversity combined signal (using a scheme of phase equalizing 
by multiple heterodyning) and of a directional antenna array signal are 
derived with the help of the extended correlation function. 

These correlation functions can be used to determine parameter values 
giving minimum correlation between two signals desirable for diversity 
systems. One can also obtain the power spectra by taking the Fourier trans- 
form of these correlation functions. Thus extended correlation functions 
promise to be useful. 


I. INTRODUCTION 


If two random variables depend on only one common parameter, 
such as time or distance, conventional correlation formula can be ap- 
plied to the two variables. However, if both of these variables involve 
not one but two common variable parameters, then the correlation 
formula found in the current literature is limited.* Since such cases occur 
in some of our mobile radio problems, as we discuss later, we need to 
define an extended correlation function and outline its properties and 
applications. 


II. DERIVATION OF AN EXTENDED CORRELATION FUNCTION OF TWO 
RANDOM VARIABLES INVOLVING TWO COMMON PARAMETERS 


A conventional normalized correlation function of two random 
variables r, and r,, both of which are functions of one parameter 


* Prior to acceptance of this paper for publication, the author was advised that 
a similar concept was discovered independently by A. Papoulis in his recently 
published book. 
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d{that is, r:(d,) and r.(d2)] can be expressed as” 
Ri(d, , dz) — mm, 


0102 
a (r1(dy)r'o(de)) av = (71(d1))uv(72(de) )av eee ch) 
(ri (di))av is (ry (dy) yev)? * [(7'3(do))av a (ro(d2))evl? 
where m’s are the mean values, o”’s are the covariances, pi2(d; , de) is 
in the range 0 S | pio(d,, d2) | S 1, and Ris(d, , de) = (ri(di)ro(de))av 
is the correlation function.* 

Supposing a random variable r,(D, ; d,) is a function of two parameters 
D, and d, , and another variable 7,(D, ; d.) is a function of two param- 
eters D, and d,; the normalized correlation functions of these two 
variables can be deduced from equation (1): 

pi2(D, , Dz 5 dy , de) 

= Rix(D, , Dz 3d, , dz) — mymz 
0199 
=3 = (r(D, ’ d,)ro( D2 » ds) dav = (ri (D ’ dy) )av(Po( Do ’ d»)) aw many 
[ri (Dy ie — (r(Di » dy) av) [rs(De , de) av — (r(D» ) ds))a0 i 
(2) 
If the problem we are dealing with is a stationary random process for 
both of the parameters D and d, then 
RD, , Dz 5 dy , dz) = Ri(Di — D2 3d, — de), 
(r,(D, , dx) )av fad (r,(0, 0)) av == Mr ) 
(rg(D, ’ di))av = Mi, = (r(0, 0)) av = mi, = a; ; 
where k = 1, 2. Now m, and o, are constants and we may let D = D, — 
D, and d = d, — d,. Then equation (2) becomes 
Ri2(D; d) — MM | 


0102 


py2(d, , dy) = 








Pi2(D; d) = (3) 


We call p,;2(D; d) a normalized extended correlation function of the first 
kind. Also we note that p,.(D; d) in equation (8) is always smaller than 
p12(0; 0) which is equal to one: 


pi2(D; d) S pi2(0; 0) = 1. 


+ The terms “correlation function Riz(d:, dz) and normalized correlation func- 
tion pie(d1, de)” are adopted from Ref. 3, p. 59. 
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When the difference D is equal to zero, then 
p12(0; d) = pi2(d). 

Now we should illustrate and extend equation (3). We are going to 
find an extended correlation function of the first kind from a function 
e(D; d, , dz, d3, +++ , dm) Where all the a’s are function of D and another 
parameter a [that is, d;(D, «) for 7 = 1, ml], then 

RAD; d, , de , ds ed » dm) 

= (e[0; d,(0, 0), d,(0, 0), d3(0, 0), oe ) 
-e(D; d,(D, a), d,(D, a), pe d,,(D, )})avs 


and the normalized correlation function can be derived from equation 
(2) as 


— RD dy, +++ 5 dy) = me 


T-~ 


p(D; d, y Oey ees 7 in) 


where 
me = (€[0; d,(0, 0), do, 0), ds(0, 0), ++ , dn(O, 0)])av 
o = (e[0; d,(0, 0), d(0, 0), d3(0, 0), oe 3 d,,(0, 0)])av = m; . 


If we consider the case d;(D, «) is a constant for all D and a itself is 


a constant, then we may assign a new symbol R.(D|d,, dz, «++ , dn) 
which can be expressed as 
R(D|d,, de, a jn) 

_ (e(0; di , do, ees »Amn)e(D; di , de , ds , ats pac) ey 4 
R(D|d,,d2,d3, +--+: , dm) is a correlation function under a condition 
that all d, , d.,d3,--- , dm are constants. The normalized correlation is 

R(D|d,,d2,-**,dn) — mM: 
p(D | dy ,d,, ++) = PLA edo mew) 


where m, and o, have been defined previously. We call equation (4) 
the normalized correlation function of the second kind. As we will show 
in the Section ITI, the extended correlation function of first kind p:.(D; d) 
and the extended correlation function of second kind p,.(D | d) can be 
used to obtain the correlation of signals from two diversity scheme re- 
ceivers easily. 

In order to give physical meaning to these functions, let us consider 
the following two cases. Suppose that two base-station multibranch 
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diversity receiver arrays are separated by a distance D. The antenna 
spacing between branch-antenna elements for the first array is d, and 
for the second array is d,. Both receivers simultaneously receive the 
signal from a distant mobile radio unit. We would like to determine 
the values of d,, d., and D to obtain the least cross-correlation de- 
sirable for the best diversity reception of these two received signals. 
The extended correlation of first kind p,.(D; d;, d.) may be used in 
this case. 

The second case assumes that a mobile radio multi-branch diversity 
receiver array, with given uniform antenna element spacing d, moves 
along the street with a constant speed V. The autocorrelation of a 
signal e, received by the mobile receiver, can be obtained from the ex- 
tended correlation function of the second kind p,(D | d). Alternatively, 
we can also consider a multielement directive antenna instead of the 
diversity scheme. In this paper, we only treat the latter case. The 
former case can be solved following the same technique. 


IiI. APPLICATION TO MOBILE RADIO PROBLEMS 


3.1 Derivation of the Correlation Function of a Signal Received from a 
Predetection Diversity Combining Receiver 


A multichannel predetection diversity combining system is a scheme 
for bringing a number of RF carriers to a common phase by means of 
multiple heterodyning. Then a linear combiner at the IF frequency is 
used to sum the individual channels.*'* A signal received from this system 
is called a predetection diversity combined signal. 

Suppose that a signal consisting of multipath vertically polarized 
waves is received by an J/-branch predetection combining mobile re- 
ceiver with a M-antenna space diversity array. The M-antennas are 
spaced by d, , d., --- dy respectively from an arbitrary common point. 
After the array has moved a distance D, the received signal e, which is 
the sum of the M individual signal amplitudes received from M indi- 
vidual antennas, can be expressed as°”’ 


e(D; d, , dz, d3 ,d,, °°: , dy) = 7,(D; d,) + r2(D;3 dz) + 13(D; ds) 
ea + rar(D; daz) 


> t'm(D; dm), (5) 


where all r,, are functions of distance D and antenna spacing d,, (see 
Appendix A). For a mobile radio signal,’* or a long range fading signal,” 
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the 7, are usually Rayleigh distributed. Suppose that all d’s are con- 
stants, then the autocorrelation function of the signal given in equation 
(5) as a function of the separation distance D is an extended autocorre- 
lation function of second kind which can be expressed as 


RAD | dy , de , ds , da , ++) 
= (e(0; a, , dz , dz , dy ) -++)e(D; d, , de ile 9a) ae 


= ([$no:e3][ S020) 


(6) 
AL M 
= (>) Dorn; 4,)7,(D; ’.)) 
m=1 n=1 av 
M MM 
= Ss. Red, =x d,). 
m=] n=1 
Using equation (3), this can also be written 
R(D|d, -++ dw) = p.(D| di, ++: ,da)(ot) +m, (7) 
where 
a. = (e'(0; d, greene , dxr)) = m ’ (8) 
M 2 
(e'(0; dy pt eg dat))av = (b> Tn(O; d.)) ) 
1 av 
M M 
= >) Dd (a0; d.)ra(O; d.))es 
M M 
= >> DR..(0; dn — d,). (9) 


Substituting equation (8) into equation (7), and combining equations 
(6) and (7), we obtain 


AM AL 
PaO | Diig Sea) Se (10) 
Ds dy Ran(O; dn — d,) — mi 
1 1 
The terms R,,,(D; d,, — d,) can be found from equation (8); 
Ran(D; dm = d,) mal Pmn(D; din An) Oo mon + MimMNny (11) 


and 
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m? = (05d «++ dy)ee = CY ral on)) 
b> (rn (0; dad) 
as ae, Ne 


Hence the correlation function of equation (10) becomes, assuming 


Om = On; 


I 


M M 


DS DS pmn(D; dn — dn) 


GAD | Gig gn) (13) 


A 


> Pmn(O; i d,) 


m=1 n=1 


If all spacings between two adjacent antennas are equal, then d,, — 
= (m — n)d, where d, is the distance between two adjacent antennas. 
We may let d = d,, and simplify the notation of equation (13) to 
M MM 


Dc Dp Pad y0) 


p.(D | d) = "#1 ____.. (14) 


M 


3 > Pmn(0; d) 


m=1 n=1 


Equation (14) shows that a normalized autocorrelation function of 
an M-branch predetection combined signal is a normalized extended 
autocorrelation function of second kind in terms of all individual nor- 
malized correlation functions between branches. We notice that 


p(D|d) S p.(0|d) = (15) 
and as stated in Section IT 
Pmn(O; d) = Pmn(d). (16) 


We may also realize that 
pi2(D; d) = p23(D; d) = psa(D; d) = 


and 
pis(D; d) = po(D; d) = p3s(D3d) = --:. (17) 
Hence, equation (14) can be further simplified as 
pAD | d) 


= Mout(M— 1) (p12 p21) + (M1 — 2) (pis + ps) + sss tpi tea 
M pi, +(M —1)(pio+ p21) + (WM —2) (pists) ssp tan ’ 
(18) 
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where Ppmn = Pmn(D; d) and p>,,(0; d) used in equation (18) are for sim- 
plicity (pn. and p>, are derived in Appendix A). If we let the antenna 
spacing d/\ = 0, then p.(D|0) from equation (18) represents the 
correlation function of two single-branch signals 


p(D|0) = Jo(BD) (19) 


which agrees with that in Ref. 6. 

Several numerical calculations have been carried out for the following 
example: Two four-branch diversity receivers, each of them with fixed 
antenna spacing d/\ = 0.5 or d/X = 1.0, are mounted on the roof of 
the mobile unit, as shown in Fig. 1. These two receivers are separated by 
a distance D/\ (D/) varies from 0 to 4) for two cases, a = 0° anda = 
90°. The calculations of the extended correlation function p,(D | d) of 
these two signals, obtained from their respective receivers when the mo- 
bile unit is moving, are shown in Figs. 2 and 3. Both figures indicate the 
values of D/) which give the least correlation between two signals. We 
also note that the correlations at a = 0° are higher than that at a = 90°. 
Figures 2 and 3 can also represent the auto correlation of a signal re- 
ceived from a single four-branch diversity receiver which has its antenna 
spacing d/\ = 0.5 or 1.0 and moves on a street with a constant speed 
V(D = Vt). The power spectrum of such a signal can be obtained by 
taking the Fourier transform of its autocorrelation function. 


3.2 Derivation of the Correlation Function of a Signal Envelope Received 
from a Directional Antenna Array 


Signal reception from a directional antenna array with M antenna 
elements has been also suggested as a means of overcoming multipath 





td be 
WHIP ANTENNA ELEMENT 


. 1—Coordinate system of a M-branch diversity mobile radio receiver 
are = 4 branches). 


3430 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 






































ie) 0.5 1.0 1S 2.0 2.5 3.0 3.5 4.0 
D/A - DISTANCE WHICH A FOUR -BRANCH DIVERSITY RECEIVER MOVES 


Fig. 2— Normalized autocorrelation function of a four-branch diversity re- 
celver moving at a = 0°. 


fading in mobile radio propagation.’°~’” The derivation of the correlation 
function of this signal envelope is as follows. 

Suppose that the same kind of signal which consists of multipath 
vertical polarized waves as mentioned in Section 3.1 is received by a 
directional M-antenna array. The M antennas are spaced by d,, dz, 
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D/A - DISTANCE WHICH A FOUR-BRANCH DIVERSITY RECEIVER MOVES 


Fig. 3— Normalized autocorrelation function of a four-branch diversity re- 
ceiver moving at a = 90°. 
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d;, -*+ , dy, respectively from an arbitrary common point. After the 
antenna array is moved by a distance D, (see Fig. 4) the received 
signal envelope « which is the amplitude of the sum of WM individual 
signals can be expressed as” 


(D;d, ,d2,d;3, teats , dy) 
| si(D; d,) + so(D; de) + +++ + sa(D; das) | 


M 


>) sa(De ds) 


1 
| X(D; di , dz , aN »du) + jY(D;d, , de , pee , dar) F (20) 


where s,, is a complex variable which represents the amplitude and the 
phase of an individual signal. X and Y are the real and imaginary parts 
of the total signal. 


I 








If the spacings between adjacent antennas are equal, then antenna m 
and antenna n are separate dyb d, — d, = (m — n)d. Therefore X and 
Y of equation (20) are functions of D and d only. Suppose that all d’s 
are constants, the autocorrelation function of signal envelope e can be 
obtained by using the equation:* 


(X1(0; d)X2(D5 d)yev + (X10; DY (D; dow 


pane (X70; a) 


(21) 
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Fig. 4— Coordinate system of a broadside directional antenna array (M = 8 
elements). 
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provided X and Y are gaussian variables, all (X,,),. are zeros, and 
(X,?),, and (Y?),, are equal, where m = 1 or 2. These facts are shown 
in Appendix B. If the antenna spacing d/A = 0, then p.(D | 0) actually 
represents the correlation between two single-branch signals, which 
agrees with equation (19) and Ref. 6. 

The normalized correlation function of a signal received from a 
broadside directional antenna array is 


Dy De (Jo As) + Jo(Bs) + Jo(4s) + 1B, - 
m=1 n=1 = = ; 22 
|> 2 [Jo(Ao) + 1.89) 


m=1 n=1 


1 
p(D |d) = 4 


where 


k= for M is even 


M 
2 
-#t Me #ore-0ad, 


and A,, B,, Az, and B, are shown in equation (48). A, and By are 
shown in equation (49). 

Several numerical calculations have been carried out for the following 
example: Two eight-element broadside antenna arrays, each of them 
with fixed antenna spacing d/A = 0.5 or d/A = 1.0, are mounted on 
the roof of the mobile unit. These two arrays are separated by a distance 
D/x (D/x varies from 0 to 4) for two cases a = 0° and a = 90°. The 
calculations of the extended correlation function p,(D | d) between two 
signals received from their respective arrays when the mobile unit is 
moving are shown in Figs. 5 and 6. Both figures indicate the values of 
D/ which have the least correlation between two signals. The extended 
correlation curve of d/X = 0.5 is quite different from that d/X = 1.0 in 
both figures. The curve of d/AX = 0.5 in Fig. 5 shows that the high 
correlation and low correlation are about 0.25\ apart; however, this 
phenomenon does not appear for d/X = 0.5, but rather for d/A = 1.0 
in Fig. 6. It can be explained as follows. For the directional antenna 
array with spacing d = )/2, most of the energy is contained in the two 
major broadside lobes, while for the directional antenna array with 
antenna spacing d = \, most of the energy is contained in the two major 
end-fire lobes. As the vehicle moves, strong standing waves may occur 
when the major antenna lobes lie in line with the motion of the vehicle, 
such as for the case a = 0° and d = 4/2; or the case a = 90° andd = 2. 
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Fig. 5— Normalized autocorrelation function of an eight-element directional 
antenna array pointing at a = 0°. 


The autocorrelations obtained from these standing waves, then, be- 
come oscillatory in nature, as we would expect. 

Figures 5 and 6 can also represent the autocorrelation of a signal re- 
ceived from an eight-element broadside antenna array which has its 
antenna spacing d/ = 0.5 or 1.0 and moves on a street with a constant 
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Fig. 6— Normalized autocorrelation function of an eight-element directional 
antenna array pointing at a = 90°. 


3434 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1969 


speed V (D = Vi). The power spectrum of such a signal can be obtained 
by taking the Fourier transform of its autocorrelation function as we 
mentioned in Section 3.1. 


IV. CONCLUSION 


The derivation of a general correlation function of two random 
variables, each of them involving two parameters, has been obtained. 
The terms ‘‘extended correlation function of first kind’”’ and ‘‘extended 
correlation function of second kind’ have been defined. The application 
of the extended correlation function is demonstrated. The correlation 
function of a diversity signal and the correlation function of a directional 
antenna array signal are derived with the help of the extended correlation 
function in this paper. Several numerical calculations have also been 
carried out. From these correlation functions we can obtain the least 
correlations between two signals under certain circumstances. Also, 
we can obtain the power spectra by taking the Fourier transform of 
these correlation functions. Thus, it seems likely that these functions 
will find general application. 
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APPENDIX A 


Finding Normalized Cross Correlation Functions From Individual 
Branch Signals of a Predetection Diversity Combining Receiver 


It is easy to show that the signal from branch m in equation (5) is 


>, A, exp [+ 98D cos (0, — a) + j(m — 1)8d cos 8,] 


= | Xa Pola |; (23) 
where 
Ay = fe Pgdes 
N 
xX, = do Ru cos ¢, + S, sin ¢. , (25) 
N 
Y,, = >) S, cos ¢, — &, sind, , (26) 
u=l 
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—¢, = BD cos (6, — a) + (m — 1) Bd cos 6, (27) 


(R,, and S, are independent gaussian amplitudes with zero mean and 
unit variance). The diversity receiver is located at the point (D, a) in 
polar coordinate. The distance D, the angle a, and the arrival of uth 
wave at angle 6, are shown in Fig. 1. We assume the N waves are uni- 
formly distributed in angle. Now we can average the product of two 
components of two branches—branch m and branch n—as 


(Xn(Dy) XD + D))av 
= (X,,(0)X,(D)) av 
= NE{cos [BD cos (6, — a) — (m — n)@d cos 86,]} 
N[Jo(a)Jo(b) — 252(a)J2(b) + 254(a)J4(d) 
— 2F,(a)Fo(b) + --] 


I 


= NJ(a’ + 0°}, (28) 
where” 
a = BD cosa — (m — n)£d, (29) 
= 6D sin a, (30) 
a’ + b® = (BD)? + (m — n)*(Bd)” — 2(m — n)B*Dd cos a, (81) 
and 
(Xn(Di) VCD: + D))av 
= (X,,(0) Y(D))av 
= NE{sin [6D cos (6, — a) — (m — n)8d cos 8,]} 
= 0. (32) 
Also 
(X Di) = AY Di) ay. (33) 


Substituting equations (28), (82), and (33) into the following equa- 


tion®’** 


(Xm(0; @)X (D5 d))ev + (Xm(0; a) Y(D; d) Yew. 
(X7,(0; d) ev 





Pnn(D; d) = (34) 


Then we obtain the final result 


Pmn(D; d) = J2[(BD)? -+ (m — n)°(BD)? — 2(m — n)*Dd cos al? 
(35) 
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and 
Pmn(d) = pmn(O; d) = Jel(m — n)Bd]. (36) 
We also can show the relations 


Piz = P23 = P34 ; P21 = P32 = Pas 


Piz = Pos = P35 3 P31 = Pae = Ps3 
Pmn 7% Pam for m ¥#n, 
Pmn(D; d) = Pnm(—D; d). 


APPENDIX B 


Finding a Normalized Correlation Function From a Real Part and an 
Imaginary Part of a Signal Received From a Directional Antenna Array 


It is easy to show that a signal consisting of N multipath vertical 
polarized waves received from an equal-spaced directional antenna 
array at a distance D from a reference position is*” 


EAD; d) = pS A, {1 + exp (jy) + exp (j2y) 


+ exp (j3y) + +++ + exp [77 — 1)y¥]} 
-exp (j8D cos 4,), (37) 
where A, was defined in equation (24), 


y = Bd sin (a — 6,) + 4, 

d is antenna spacing between two antennas, 

M is the number of elements, 

a is the normal direction of the array, 

6 is the relative phase between antennas, 

D is the distance measured from the coordinate origin to the center 
position of antenna array. (The center position of the antenna 
array is assumed always on the axis, that is, at the position 
(D, 0).), and 

6, is the angle of arrival of the uth wave and is assumed to be uni- 
formly distributed. 


The coordinate system of a directional antenna array is shown in 
Fig. 6. Since the spacings between antennas are equal, we can let the 
phase refer to the center point of the array. Then equation (37) can be 
simplified by combining the first term and the Mth term, the second 
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term and the (M — 1)th term and so forth.”* The result becomes 


N 
E,(D; d) = >* A, exp (j8D cos 6,) 
u=1 





|: cos (M+ y) + 2 eos (M> 3 y) = a +20 | , 
(38) 
where 


Q=1 if M = odd 


cos (4y) if M = even. 
Equation (38) can be separated into a real part and an imaginary part as 


E,(D;d) = X + jY 


and 
e(D; d) = | £,(D; d) | = (X? + Y’J}, (39) 
where 
K N 
x = 2 oe )> [R, cos (8D cos 6,) — S,sin (BD cos 6,)] 
m=1 u=l 
(M + 1— 2m ) 
“0S. 
2 
K 
=)2 SS ara (40) 
m=) 
K N 
Y =2 >) D> [R., sin (6D cos 6,) + S, cos (BD cos 8,)] 
m=1 ul 
sine (¥ +1—2m v) 
2 
K 
= 2 x, Ym » (41) 
where 
K = a if WM is even 
ge ui ae ae olds (42) 


2 
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Since R, and S, are independent gaussian variables, it is easy to 
realize that all z,, and y,, are independent gaussian variables. Hence 
X and Y are also gaussian variables. The mean values of X and Y are 
zeros, and the mean squares of X and Y are the same. Therefore equa- 
tion (21) can be applied. The following term in equation (21) can be 
replaced by 


oa 


Lm(0; o|| > Lm(D a| 


m=1 


(X,(0; d)X2(D; d))aw = i) 


3 
ll 


> 


> (2,(0; d)Zn(D; d))av - (43) 


n= 


Mer 


+ 


_ 


m= 


The term (X,(0; d) Y.(D; d)) also can be obtained, and is equal to equa- 
tion (48), by replacing x, by y, . Then equation (20) becomes 


p(D/d) = 
[> Y (oal0sdaa(D; Dx | ie |= > (ea(0} Dy.(D 3A) f 
| > & eal; dr; 0). | 


m=1 n=1 





» (44) 





where K is shown in equation (42), and 
(tm(0; d)x,(D; d))w = © (cos (8D cos 0@,)-{cos (U7 +1 — m—n)y] 


+ cos [(m — n)W]} ow 5 


(en(0; d)ys(D; dav = F (Sin (BD cos 8,)- {cos [(M + 1 — m—n)¥] 


- cos [(m — 2) W\})av , (45) 
and 
= Bd sin (a — 0,) + 6. 


Now we may consider only a broadside directional antenna array, 
that is, 6 = 0. Then the following terms can be derived:”” 


(cos (a cos 9,)-cos [b sin (a — 4,)])av 
= 3(cos [((a + bsina)cos #, — b cosasin 0,] 
+ cos [(a — bsina) cos 6, + b cosasin 6,])ay 


= 3[J.(A) + Jo(B)I, (46) 
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where 
A = (a + 2ab sin a + b°)?, 
B = (av — 2absina + 0b’)}, 
(sin (a cos 6,) cos [b sin (a — 6,)]) = 0. (47) 
Inserting the general formulas equation (46) and equation (47) into 


equation (45), it becomes 


(04(0; d)e4(D; doe = [Jo(As) + Fo(Bs) + Jo(As) + JolB.) 


(tn(0; d)yn(D; d))av = 9 (48) 
where 

-) = 6[(D? + 2Dd(M +1—m—n)sine+d(M+1—m—n)*} 

A, 2 : 2 2} 

= B[D° + 2Dd(m — n) sina + d(m — n)’]’. 


From equation (48), we can deduce the results 


(2m(0; d)xn(0; d))av 
= N{J[pdW + 1 — m — n)] + JolBd(m — n)]} 
= N[Jo(Ao) + Jo(Bo)] (49) 
and 
(z,(0; d)) = N{Jo[8dz + 1 — 2m)] + 1}. (50) 


Then substituting equations (48) and (49) into equation (44), we com- 
plete the derivation of a normalized correlation function of a signal 
received from a broadside directional antenna array. 
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Sputtered Glass Waveguide for Integrated Optical Circuits 


By J. E. GOELL and R. D. STANDLEY 


(Manuscript received September 16, 1969) 


A series of papers which appeared in the September 1969 issue of the 
Bell System Technical Journal treated the theory of dielectric wave- 
guides and stressed the potential use of such media for optical communi- 
cation circuits.""* Here we report on the realization of low-loss, thin 
glass films which can be used for circuit fabrication. Methods of pre- 
paring planar films and waveguides having rectangular cross section are 
described along with the techniques used in evaluating their optical 
characteristics. 

The films we used for waveguide fabrication have been prepared by 
RF Sputtering of suitable glasses. The sputtering system used was oil- 
diffusion pumped and had five-inch diameter electrodes. Oxygen was 
used as the sputtering gas. The best films obtained to date were made by 
sputtering Corning 7059 glass. For convenience, in the early stages of 
this work, laboratory slides have been used as substrates. Necessary 
steps were taken to ensure that the substrates were clean. 

The index of refraction of the films was measured to be 1.62 by 
determining Brewster’s Angle for the films as described by Abeles.” From 
the color of the film and by interferometer methods the film thickness 
was found to be about 0.3 um. 

The transmission loss of the films was measured by two methods. 
Both use prisms to launch a light beam into the film.°’’ In method 1 it is 
assumed that the scattering centers in the films are uniformly distrib- 
uted. A fiber optic probe is then used to measure the intensity of the 
light scattered at right angles to the film. In method 2, the intensity of 
the output beam is measured as a function of launcher position along 
the film. Method 2 appears least accurate due to variations in launching 
efficiency as a function of prism movement. Method 1 works well to 
losses of the order of 1 db per cm. Below this level, the variability in the 
strength of the scattering centers makes reliable measurements difficult. 
An increase in film length would partially overcome the difficulty of 
measuring low level scattering from random centers. 
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Fig. 1— Light seattered from a beam propagating in a Corning 7059 glass film. 
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Fig. 2— Relative scattered power versus length (7059 glass film). 
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Fig. 3—Section of a rectangular waveguide (1000). 





Fig. 4— Light propagating in a curved section of rectangular waveguide. 
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Figure 1 is a picture of the light scattered from a beam propagating in 
the film. The intensity of scattered light as measured by the fiber optic 
probe is plotted in Figure 2. The average slope is less than —1 dB/cm. 
This result is in agreement with measurements made by the second 
method. The lack of uniformity of the scattered light intensity is due, at 
least in part, to inhomogeneities in the substrate. By using a higher 
quality substrate this source of scatter can be eliminated. 

Curved sections of rectangular waveguides have been constructed 
from 7059 glass films by back-sputtering using quartz fibers as shadow 
masks. The waveguides were about 0.3 wm thick, 20 um wide, and had a 
radius of curvature of about 4 inch. A photograph of a typical section is 
shown in Figure 3. Figure 4 shows prism-launched light propagating in 
such a waveguide. Due to the small size of the waveguide our instru- 
mentation will have to be improved before loss measurements can be 
made. 

Our initial efforts have demonstrated the feasibility of using sputtered 
glass films and sputter etching in the fabrication of optical waveguides. 
This approach shows promise as a method of producing low-loss optical 
integrated circuits. 

The authors are indebted to W. R. Sinclair for his valuable comments 
regarding the sputtering of glass films and the preparation of substrates, 
and to R. R. Murray who assisted in the preparation of the films and 
waveguides. 
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